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TITLE OF THE INVENTION 
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FIELD OF THE INVENTION 

25 The present invention relates to an isolated nucleic acid 

molecule (polynucleotide) which encodes a novel human uncoupling 
protein, referred to throughout as uncoupling protein 3 (UCP3). The 
present invention also relates to recombinant vectors and recombinant 
hosts which contain a DNA fragment encoding human UCP3, 

30 substantially purfied forms of associated human UCP3, human UCP3 
mutant proteins, and methods associated with identifying compounds 
which modulate energy expenditure and body weight regulation, as well 
as affecting diseases such as obesity and diabetes. 

35 
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BACKGROUND OF THE INVENTION 

The balancing of energy expenditure and food intake 
plays a critical role in the control of body weight. Heat generation from 
uncoupling activity in mitochondria is responsible for a significant 

5 portion of energy expenditure, and therefore regulation of uncoupling 
activities provides a direct target for the treatment of obesity. 

It is known that uncoupled mitochondrial respiration in 
brown adipose tissue (BAT) plays an important role in the regulation of 
energy balance in rodents. 

10 Bouillaud et al. (1986, J. Biol. Chem.. 261(4): 1487-1490) 

disclose a rat cDNA clone which encodes rat uncoupling protein 1 
(UCP1). Rat UCP1 is a mitochondrial inner membrane protein 
synthesized in the cytosol from a nuclear encoded transcript which is 
expressed in brown adipose tissue. 

15 Bouillaud et al. (1988, Biochem. Biophys Res. Comm. 157: 

783-792) disclose a genomic clone encoding rat UCP1. 

Nicholls and Locke (1984, Physiol. Rev. 64: 1-64) review 
UCP1, noting that the rat 32 kD protein forms a proton channel through 
the mitochondrial inner membrane and is active in uncoupling ATP 

20 synthesis from heat production in BAT. 

Jacobsson et al. (1985, J. Biol. Chem. 260: 16250-16254) 
disclose cDNA clones for mENA isolated from mouse brown adipose 
tissue. Mouse ucpl mRNA was shown to be induced in brown adipose 
tissue by exposure to cold. 

25 Kozak et al. (1988, J. Biol. Chem. 263: 12274-12277) disclose 

the genomic clone encoding mouse UCPl. 

Cassard et al. (1990, J. Cell. Biochem. 43: 255-264) discloses 
the genomic clone for human ucpl, as well as the deduced amino acid 
sequence for human UCPl. The authors show that rat and human 

30 UCPl are 79% homologous at both the nucleotide and amino acid level. 

UCPl activity in thermogenesis and uncoupled energy 
dissipation is limited to brown adipose tissue. Therefore, it is not 
expected that UCPl is actively involved in the cause and effect of body 
weight indications such as obesity and diabetes in vertebrates such as 

35 humans, which contain limited amount of brown fat. 
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Fleury et al. (1997, Nature Genetics 15: 269-272) disclose a 
gene encoding both a mouse and human mitochondrial uncoupling 
protein, designated mouse UCP2 and human UCP2, respectively. The 
deduced human amino acid sequence encodes a 33 kD protein which is 

5 approximately 95% homologous to the mouse protein at the amino acid 
level. As with human UCP1, human UCP2 comprises 3 mitochondrial 
carrier protein motifs and an ATP binding site. The authors show a 
wide range of tissue specific expression of the human ucp2 gene, 
including skeletal muscle, lung, heart, placenta, stomach, as well as 

10 immune systems tissue such as spleen, thymus, leukocytes, 

macrophages and bone marrow. The authors mapped the human ucp2 
gene to chromosome 11, which has been linked to obesity and 
hyperinsulinaemia in the mouse. 

It would be advantageous to identify a gene encoding an 

15 additional human uncoupling protein wherein expression is for the 
most part limited to skeletal muscle. A nucleic acid molecule 
expressing an additional human uncoupling protein in such a specific 
manner would be extremely useful in screening for compounds acting 
as a modulator of to obesity and hyperinsulinaemia. Such a compound 

20 or compounds will be useful in controlling obesity as well as deleterious 
indications associated with obesity, such as diabetes. Additionally, such 
a nucleic acid molecule will be useful in gene therapy applications to 
overcome the deleterious effects of obesity and obesity-related 
complications, such as diabetes. The present invention addresses and 

25 meets this need. 

SUMMARY OF THE INVENTION 

The present invention relates to a purified or isolated 
nucleic acid molecule (polynucleotide) which encodes a novel vertebrate 
30 uncoupling protein. 

The present invention relates to a purified or isolated 
nucleic acid molecule (polynucleotide) which encodes a novel human 
uncoupling protein. 

The present invention also relates to a purified or 
35 isolated nucleic acid molecule which encodes a human uncoupling 
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protein which is predominantly expressed within human skeletal 
muscle. 

A preferred aspect of the present invention is disclosed 
in Figure 1A-1B and SEQ ID NO:ll, a purified human cDNA 

5 encoding a novel uncoupling protein, UCP3. 

Another specific embodiment of the present invention 
relates to isolated biologically active fragments or mutants of a 
nucleic acid molecule encoding human uncoupling protein 3, 
disclosed herein in Figure 2A-2D, Figure 3 and SEQ ID NO:12. Any 

10 such polynucleotide includes but is not necessarily limited to 
nucleotide substitutions, deletions, additions, amino-terminal 
truncations and carboxy-terminal truncations such that these 
mutations encode mRNA which express a protein or protein 
fragment of diagnostic, therapeutic or prophylactic use. 

15 The present invention relates to a purified or isolated 

nucleic acid molecule (polynucleotide) which encodes a novel mouse 
uncoupling protein, as set forth as SEQ ID NO: 17. 

The present invention also relates to a purified or 
isolated nucleic acid molecule which encodes a mouse uncoupling 

20 protein which is predominantly expressed within human skeletal 
muscle, the expressed protein as set forth in SEQ ID NO:18. 

The purified nucleic acid molecule of the present 
invention may include a deoxyribonucleic acid molecule (DNA), such 
as genomic DNA and complementary DNA (cDNA), which may be 

25 single (coding or noncoding strand) or double stranded, as well as 
synthetic DNA, such as a synthesized, single stranded 
polynucleotide. The isolated nucleic acid molecule of the present 
invention may also include a ribonucleic acid molecule (RNA). 

The present invention also relates to recombinant 

30 vectors and recombinant hosts, both prokaryotic and eukaryotic, 
which contain the substantially purified mammalian nucleic acid 
molecules disclosed throughout this specification. 

The present invention also relates to subcellular 
membrane fractions of the recombinant host cells (both prokaryotic 

35 and eukaryotic as well as both stably and transiently transformed 
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cells) comprising the nucleic acids of the present invention. These 
subcellular membrane fractions will comprise UCP3 at levels 
substantially above wild type levels and hence will be useful in 
various assays described throughout this specification. 

5 The present invention also relates to a substantially 

purified human uncoupling protein wherein the native form is 
substantially localized within mitochondria of skeletal muscle. 

A preferred aspect of the present invention is disclosed 
in Figure 2 and SEQ ID NO: 12, the amino acid sequence of the 

10 exemplified human uncoupling protein, hUCP3. 

Another preferred aspect of the present invention is 
disclosed in Figure 7 and SEQ ID NO: 18, the amino acid sequence of 
the exemplified mouse uncoupling protein, mUCP3. 

The present invention also relates to biologically active 

15 fragments and/or mutants of a novel human uncoupling protein, 
including but not necessarily limited to amino acid substitutions, 
deletions, additions, amino terminal truncations and carboxy- 
terminal truncations such that these mutations provide for proteins 
or protein fragments of diagnostic, therapeutic or prophylactic use. 

20 The present invention also relates to methods of expressing the 

UCP proteins or protein fragments disclosed herein, assays employing 
these human UCPs, cells expressing these UCPs, and compounds 
identified through the use of these UCPs, including modulators of 
energy expenditure and body weight regulation, either through direct 

25 contact with the mitochondrial uncoupling protein, or a compound 
which acts in trans to modulate UCP expression. Such modulators 
identified in this process are useful as therapeutic agents for controlling 
obesity, diabetes and other related diseases. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1A and Figure IB show the nucleotide sequence 
(SEQ ID NO:ll) which comprises the full-length cDNA encoding human 
uncoupling protein 3. 
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Figure 2A, Figure 2B, Figure 2C and Figure 2D show the 
translation of the open reading frame (SEQ ID NO:12) of the full-length 
human uncoupling protein 3 nucleotide sequence. 

Figure 3 shows the amino acid sequence of human uncoupling 
5 protein 3 (SEQ ID NO:12). 

Figure 4A, Figure 4B and Figure 4C show Northern blot analysis 
of human uncoupling protein 3 in human tissue. 

Figure 5 shows the orientation of vector and gene specific primers 
used to screen a human fetal brain cDNA library for a full-length 
10 human UCP3 cDNA. 

Figure 6A and 6B shows the nucleotide sequence (SEQ ID NO: 17) 
of a cDNA which encodes the full-length mouse uncoupling protein 3. 

Figure 7 shows the ammo acid sequence (SEQ ID NO: 18) of the 
mouse uncoupling protein 3. 
15 Figure 8 A, Figure 8B and Figure 8C show flow cytometry analysis 

of mitochondria membrane potential with UCP3 expression in yeast. 
TheS. cerevisiae strain W303 containing vector alone (Figure 8A) or 
UCP3 expression plasmids (Figure 8B and Figure 8C) were stained with 
the potential sensitive dye, DiOC6 and analyzed by FACS. The X-axis 
20 represents the intensity of fluorescence on logarithmic scale while the 
Y-axis indicates the number of cells. a +gaF indicates that cells were 
induced with galactose for 5 hours. "-gaT indicates cells were 
maintained in raffinose media without the induction by galactose. A 
decrease in fluorescence intensity indicates a reduction in mitochondria 
25 membrane potential. 

Figure 9A and Figure 9B shows the response of UCP3 
expression to adenovirus-mediated leptin treatment. Figure 9A shows 
measurement of weight loss* Y-axis indicates the average weight 
expressed in percentage of weight releative to the day of injection (Day 0). 
30 Figure 9B shows UCP3 mRNA level. Y-axis indicates the relative level 
of UCP3 mRNA (the average of the Adb-gal and control animals is 
assigned a value of 1). Standard deviation for each group is as marked. 
Ad-Leptin: first generation El-deleted, replication deficient adenoviral 
vector expressing leptin; HD-leptin: helper-dependent adenoviral vector 
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expressing leptin; Adb-gal: first generation adenoviral vector expressing 
b-galactosidase; Control: untreated. 

DETAILED DESCRIPTION OF THE INVENTION 
5 The present invention relates to isolated nucleic acid 

and protein forms which represent a vertebrate uncoupling protein, 
preferably human and mouse forms. The isolated genes and 
concomitant translation products which make up a portion of the 
present invention may be used in various ways in the treatment of 

10 obesity and other related diseases, including but not limited to diabetes. 
The uncoupling protein of the present invention is a protein which is 
translated in the cytosol from a nuclear encoded mKNA transcript. 
This uncoupling protein is presumably imported into the mitochondrial 
inner membrane post-translationally. 

15 The process of oxidative phosphorylation pumps protons to 

the outside of the inner membrane of the mitochondria, leading to the 
generation of a membrane potential across the inner membrane. This 
membrane potential is then used to drive ATP synthesis. The purified 
uncoupling proteins of the present invention have the ability to uncouple 

20 the protonmotive force associated with oxidative phosphorylation, 

resulting in the concomitant release of large amounts of energy. It will 
be within the scope of the present invention to use the herein disclosed 
nucleic acids to construct recombinant vectors and recombinant host 
cells wherein the human UCP3 gene is expressed, either stably or 

25 transiently. Such transformed recombinant cell lines will be useful in 
screening for modulators of UCP3 activity, and hence, as effectors of 
energy expenditure and body weight regulation. The isolated nucleic 
acid fragments of the present invention will also be useful in gene 
therapy applications regarding therapeutic treatment of obesity and 

30 obesity related indications, including but not limited to diabetes, as well 
as conditions such as mitochondrial-associated hypermetabolism. 

To this end, the present invention relates to a purified or 
isolated nucleic acid molecule (polynucleotide) which encodes a novel 
human uncoupling protein. 
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The present invention also relates to a purified or 
isolated nucleic acid molecule which encodes a human uncoupling 
protein which is predominantly expressed within human skeletal 
muscle. 

5 A preferred aspect of the present invention is disclosed 

in Figure 1 and SEQ ID NO:ll, a purified human cDNA encoding a 
novel uncoupling protein, UCP3, which is as follows: 

TCGAACTCAC TCACCTCCCC TCTCACCTCA CTGCCCTCAC CAGCCAGCCT 
CTTGTCAAGT GATCAGGCTG TCAACCAACT TCTCTAGGAT AAGGTTTCAG 

10 GTCAGCCTGT GTGTATAAGA CCAGTGCCAA GCCAGAAGCA GCAGAGACAA 
CAGTGAATGA CAAGGAGGGG CCATCCAATC CCTGCTGCCA CCTCCTGGGA 
TGGAGCCCTA GGGAGCCCCT GTGCTGCCCC TGCCGTGGCA GGACTCACAG 
CCCCACCGCT GCACTGAAGC CCAGGGCTGT GGAGCAGCTC TCTCCTTGGA 
CTCCTCTCGG CCCTAAAGGG ACTGGGCAGA GCCTTCCAGG ACTATGGTTG 

15 GACTGAAGCC TTCAGACGTG CCTCCCACCA TGGCTGTGAA GTTCCTGGGG 
GCAGGCACAG CAGCCTGTTT TGCTGACCTC GTTACCTTTC CACTGGACAC 
AGCCAAGGTC CGCCTGCAGA TCCAGGGGGA GAACCAGGCG GTCCAGACGG 
CCCGGCTCGT GCAGTACCGT GGCGTGCTGG GCACCATCCT GACCATGGTG 
CGGACTGAGG GTCCCTGCAG CCCCTACAAT GGGCTGGTGG CCGGCCTGCA 

20 GCGCCAGATG AGCTTCGCCT CCATCCGCAT CGGCCTTTAC GACTCCGTCA 
AGCAGGTGTA CACCCCCAAA GGCGCGGACA ACTCCAGCCT CACTACCCGG 
ATTTTGGCCG QCTGCACCAC AGGAGCCATG GCGGTGACCT GTGCCCAGCC 
CACAGATGTG GTGAAGGTCC GATTTCAGGC CAGCATACAC CTCGGGCCAT 
CCAGGAGCGA CAGAAAATAC AGCGGGACTA TGGACGCCTA CAGAACCATC 

25 GCCAGGGAGG AAGGAGTCAG GGGCCTGTGG AAAGGAACTT TGCCCAACAT 
CATGAGGAAT GCTATCGTCA ACTGTGCTGA GGTGGTGACC TACGACATCC 
TCAAGGAGAA GCTGCTGGAC TATCACCTGC TCACTGACAA CTTCCCCTGC 
CACTTTGTCT CTGCCTTTGG AGCCGGCTTC TGTGCCACAG TGGTGGCCTC 
CCCGGTGGAC GTGGTGAAGA CCCGGTATAT GAACTCACCT CCAGGCCAGT 

30 ACTTCAGCCC CCTCGACTGT ATGATAAAGA TGGTGGCCCA GGAGGGCCCC 
ACAGCCTTCT ACAAGGGATT TACACCCTCC TTTTTGCGTT TGGGATCCTG 
GAACGTGGTG ATGTTCGTAA CCTATGAGCA GCTGAAACGG GCCCTGATGA 
AAGTCCAGAT GTTACGGGAA TCACCGTTTT GAACAAGACA AGAAGGCCAC 
TGGTAGCTAA CGTGTCCGAA ACCAGTTAAG AATGGAAGAA AACGGTGCAT 

35 CCACGCACAC ATGGACACAG ACCCACACAT GTTTACAGAA CTGTTGTTTA 
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CTTGTTGCTG ATTCAAGAAA CAGAAGTAGA AGAGAGAGGA TTCTGGTCTT 
CACTGCCATG CCTCAAGAAC ACCTTTGTTT TGCACTGACA AGATGGAAAA 
TAAATTATAT TAATTTTTGA AACCCATTAG GCATGCCTAA TATTTAGGCA 
AGAGAAAATA AACCAAGATA GATCCATTTG GACAAAATGG AAGGTTGGAG 
5 ACGTGTATCC CCGTGAAATC TGGTCAGATA ATGAATGATA AGCAGGAAGG 
ATGGCAAGCA CGGGACAGGA GGGGCCCACA ATGGAGTGGG AGATCAGCCA 
CGGAGCCTGG AGGGACGCCA CCCAGCAACA CAGAGCTGGC GACTGCAGCT 
GCACCATCAC ACATGCATCA TCAGCCTATT TGTAATATGT CTGCCACAGA 
GAGTCCTTTG GGATTCTAGG AAACCCAAGG AACAAGAGAA AAAACTAGAG 

10 CCTGTGCTAA AGAAGCCTGC TGGGCCCATG TGAGGCTGGG GTCGTAAATA 
TTCCCCGACG ACACTGAAGA ATCAAGAGGG CAGCCCCCAC TTCTCCTACA 
AAATGGAGGG AGCCATCCCT TCCCTGTCCA CCTCACCAGG GGTGCTATGA 
CATGCAAGTG AGAAGCTGGG CATGAACGCA CTTTATAAAA GCAAAAGCTC 
TGTGTAAATC TAACTACAAG GACAATGCCT TGGGAGAGAT TTTGTCGGGA 

15 CAGAGAGGAG TTGCCAGGGA AGAAGGTTTG AAAGATACGG TTGTCTAGAG 
GTGAGACCAA AGGATCCAGA GACTTGGGGA CCAGAGGTGA CAGTGGATGA 
CGTGAAGCCA CAGGAGCCCC ACCCCCATGC AGCTTTTTCC CCACCCCCCC 
CACCACGCGC TCAATCATGA GTACCTCAAA GGATTGTTGG GCTTGGGGGA 
AAAGAGGTGG ATTCCTGGGC AAGAACCTAA AGTAGCAGGA (SEQ ID NO:ll). 

20 Another specific embodiment of the present invention 

relates to isolated biologically active fragments or mutants of a 
nucleic acid molecule encoding human uncoupling protein 3, 
disclosed herein as SEQ ID NO:ll (see also Figure 1A-1B). Any such 
polynucleotide includes but is not necessarily limited to nucleotide 

25 substitutions, deletions, additions, amino-terminal truncations and 
carboxy-terminal truncations such that these mutations encode 
mRNA which express a protein or protein fragment of diagnostic, 
therapeutic or prophylactic use. 

The purified nucleic acid molecule of the present 

30 invention may include a deoxyribonucleic acid molecule (DNA), such 
as genomic DNA and complementary DNA (cDNA), which may be 
single (coding or noncoding strand) or double stranded, as well as 
synthetic DNA, such as a synthesized, single stranded 
polynucleotide. The isolated nucleic acid molecule of the present 

35 invention may also include a ribonucleic acid molecule (RNA). 
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The present invention also relates to recombinant 
vectors and recombinant hosts, both prokaryotic and eukaryotic, 
which contain the substantially purified nucleic acid molecules 
disclosed throughout this specification. 
5 The present invention also relates to a substantially 

purified human uncoupling protein wherein the native form is 
substantially localized within mitochondria of skeletal muscle. 

A preferred aspect of the present invention is disclosed 
in Figure 2A-2D, Figure 3 and SEQ ID NO: 12, the amino acid 
10 sequence of the exemplified human uncoupling protein, UCP3, 
which is as follows: 

MVGLKPSDVPPTMAWFLGAGTAACFADLWFPLDTAKVRLQIQGENQAVQTARLV 
QYRGVLGTILTMVRTEGPCSPYNGLVAGLQRQMSFASIRIGLYDSVKQVYTPKGAD 
NSSLTTRILAGCTTGAMAVTCAQ 
15 IAREEGVRGLWKGTLPNIMRNAIWC 
FGAGFC ATWAS PVDVVKTRY1W 

lrlgswnvvmfvtyeqlkralmkvqmlrespf , as set forth in three-letter 
abbreviation in SEQ ID NO:12. 

The present invention also relates to biologically active 

20 fragments and/or mutants of a novel human uncoupling protein, 
including but not necessarily limited to amino acid substitutions, 
deletions, additions, amino terminal truncations and carboxy- 
terminal truncations such that these mutations provide for proteins 
or protein fragments of diagnostic, therapeutic or prophylactic use. 

25 The present invention also relates to methods of expressing the 

vertebrate UCP proteins or protein fragments disclosed herein, assays 
employing these human and mouse UCPs, cells expressing these UCPs, 
and compounds identified through the use of these UCPs, including 
modulators of energy expenditure and body weight regulation, either 

30 through direct contact with the mitochondrial uncoupling protein, or a 
compound which acts in trans to modulate ucp expression. Such 
modulators identified in this process are useful as therapeutic agents 
for controlling obesity, diabetes and other related diseases. 

Other uses of the nucleic acid fragment and protein fragments of 

35 the present invention include, but are not limited to, identifiying trans- 
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acting factors which regulate UCP3 expression. It will then be possible 
to identify compounds which affect a protein or protein complex acting 
in trans to up or down regulate vertebrate UCP3 expression such that 
administration of such a compound or pharmaceutically acceptable salt 
5 in a pharmaceutically acceptable formulation will modulate UCP3 
expression and in turn the degree of energy expenditure within the 
patient It will be known to one of skill in the art that use of such an 
assay may proceed by construction of a prokaryotic or eukaryotic 
expression vector wherein a portion of the UPC3 5' non-coding region is 

10 fused to an appropriate reporter gene (including but not limited to p- 
galactosidase) which is utilized to quantitatively measure the effect of a 
particular compound on UPC3 gene regulation. It will also be possible 
to identify compounds through screens utilizing recombinant host cells 
stably or transiently transfected with a UPC3 encoding gene fragment. 

15 These compounds will affect, for example, the level of uncoupling of 
oxidative phosophorylation from ATP synthesis in mitochondria of 
transfected cells. 

Therefore, the present invention also relates to methods 
of expressing a nucleic acid fragment encoding UCP, including but 

20 not limited to the exemplified human and/or mouse ucp3 or gene 

fragments disclosed herein, recombinant prokaryotic and eukaryotic 
expression vectors containing such a nucleic acid fragment, 
recombinant host cells transfected with these vectors wherein a ucp 
gene or gene fragment is expressed at predetermined levels through 

25 appropriate vector construction, and compounds identified through 
the interaction with either regulatory regions of a ucp gene or a 
compound which interacts with the mature, mitochondrial-localized 
form of a UCP protein. Such a compound may be useful in treating 
disorders associated with regulation of body weight and concomitant 

30 intake and expeniture of energy (or lack thereof), including but not 
limited to obesity and diabetes. 

To this end, the present invention also relates to a 
purified or isolated nucleic acid molecule (polynucleotide) which 
encodes a novel mouse uncoupling protein, as set forth as SEQ ID 

35 NO: 17, and as follows: 
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CCAGGAACAG CAGAGACAAC AGTGAATGGT GAGGCCCGGC CGTCAGATCC 
TGCTGCTACC TAATGGAGTG GATCCTTAGG GTGGCCCTGC ACTACCCAAC 
CTTGGCTAGA CGCACAGCTT CCTCCCTGAA CTGAAGCAAA AGATTGCCAG 
GCAAGCTCTC TCCTCGGACC TCCATAGGCA GCAAAGGAAC CAGGCCCATT 
5 CCCCGGGACC ATGGTTGGAC TTCAGCCCTC CGAAGTGCCT CCCACAACGG 
TTGTGAAGTT CCTGGGGGCC GGCACTGCGG CCTGTTTTGC GGACCTCCTC 
ACTTTTCCCC TGGACACCGC CAAGGTCCGT CTGCAGATCC AAGGGGAGAA 
CCCAGGGGCT CAGAGCGTGC AGTACCGCGG TGTGCTGGGT ACCATCCTGA 
CTATGGTGCG CACAGAGGGT CCCCGCAGCC CCTACAGCGG ACTGGTCGCT 

10 GGCCTGCACC GCCAGATGAG TTTTGCCTCC ATTCGAATTG GCCTCTACGA 
CTCTGTCAAG CAGTTCTACA CCCCCAAGGG AGCGGACCAC TCCAGCGTCG 
CCATCAGGAT TCTGGCAGGC TGCACGACAG GAGCCATGGC AGTGACCTGC 
GCCCAGCCCA CGGATGTGGT GAAGGTCCGA TTTCAAGCCA TGATACGCCT 
GGGAACTGGA GGAGAGAGGA AATACAGAGG GACTATGGAT GCCTACAGAA 

15 CCATCGCCAG GGAGGAAGGA GTCAGGGGCC TGTGGAAAGG GACTTGGCCC 
AACATCACAA GAAATGCCAT TGTCAACTGT GCTGAGATGG TGACCTACGA 
CATCATCAAG GAGAAGTTGC TGGAGTCTCA CCTGTTTACT GACAACTTCC 
CCTGTCACTT TGTCTCTGCC TTTGGAGCTG GCTTCTGTGC CACAGTGGTG 
GCCTCCCCGG TGGATGTGGT AAAGACCCGA TACATGAACG CTCCCCTAGG 

20 CAGGTACCGC AGCCCTCTGC ACTGTATGCT GAAGATGGTG GCTCAGGAGG 
GACCCACGGC CTTCTACAAA GGATTTGTGC CCTCCTTTCT GCGTCTGGGA 
GCTTGGAACG TGATGATGTT TGTAACATAT GAGCAACTGA AGAGGGCCTT 
AATGAAAGTC CAGGTACTGC GGGAATCTCC GTTTTGAACA AGGCAAGCAG 
GCTGCCTGGA ACAGAACAAA GCGTCTCTGC CCTGGGGACA CAGGCCCACA 

25 CGGTCCAGAA CCCTGCACTG CTGCTGACAC GAGAAACTGA ACTAAAAGAG 
GAGAGTTTTA GTCCTCCGTG TTTCGTCCTA AAACACCTCT GTTTTGCACT 
GACCTGATGG GAAATAAATT ATATTAATTT TTAAACCCTT TCCGGTTGGA 
TGCCTAACAT TTAGGCAAGA GACAACAAAG AAAACCAGAG TCAACTCCCT 
TGAAATGTAG GAATAAAGGA TGCATAATAA ACAGGAAAGG CACAGGTTTT 

30 GAGAAGATCA GCCCACAGTG TTGTCCTTGA ATCAAACAAA ATGGTCGGAG 
GAACCCTTCG GGTTCAGCAC AAAGAGGTGA CTACAGCCTT TTGGTCACCA 
GATGACTCCG CCCCTTTGTA ATGAGTCTGC CAAGTAGACT CTATCAAGAT 
TCTGGGGAAA GGAGAAAGAA CACATTGACC TGCCCGGGCG GCCGCTCGAG 
CCCTATGA , disclosed as SEQ ID NO:17. 
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The present invention also relates to mouse forms of 
isolated biologically active fragments or mutants of a nucleic acid 
molecule encoding human uncoupling protein 3, disclosed herein as 
SEQ ID NO:17 (see also Figure 6A-6B). Any such polynucleotide 
5 includes but is not necessarily limited to nucleotide substitutions, 
deletions, additions, ammo-terminal truncations and carboxy- 
terminal truncations such that these mutations encode mRNA 
which express a protein or protein fragment of diagnostic, 
therapeutic or prophylactic use. 

10 The purified mouse nucleic acid molecules of the 

present invention may include a deoxyribonucleic acid molecule 
(DNA), such as genomic DNA and complementary DNA (cDNA), 
which may be single (coding or noncoding strand) or double 
stranded, as well as synthetic DNA, such as a synthesized, single 

15 stranded polynucleotide. The isolated nucleic acid molecule of the 
present invention may also include a ribonucleic acid molecule 
(RNA). 

The present invention also relates to recombinant 
vectors and recombinant hosts, both prokaryotic and eukaryotic, 

20 which contain the substantially purified mouse nucleic acid 
molecules disclosed throughout this specification. 

The present invention also relates to a substantially 
purified mouse uncoupling protein wherein the native form is 
substantially localized within mitochondria of skeletal muscle. 

25 A preferred aspect of the present invention is disclosed 

in Figure 7 and SEQ ID NO:18, the amino acid sequence of the 
exemplified mouse uncoupling protein, UCP3, which is as follows: 

MVGLQPSEVP PTTWKFLGA GTAACFADLL TFPLDTAKVR LQIQGENPGA 
QSVQYRGVLG TILTMVRTEG PRSPYSGLVA GLHRQMSFAS IRIGLYDSVK 
30 QFYTPKGADH SSVAIRILAG CTTGAMAVTC AQPTDWKVR FQAMIRLGTG 
GERKYRGTMD AYRTIAREEG VRGLWKGTWP NITRNAIVNC AEMVTYDIIK 
EKLLESHLFT DNFPCHFVSA FGAGFCATW ASPVDWKTR YMNAPLGRYR 
SPLHCMLKMV AQEGPTAFYK GFVPSFLRLG AWNVMMFVTY EQLKRALMKV 

QVLRES PF* , as set forth in three-letter abbreviation in SEQ ID NO:18. 
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The present invention also relates to biologically active 
fragments and/or mutants of a novel mouse uncoupling protein, 
including but not necessarily limited to amino acid substitutions, 
deletions, additions, ammo terminal truncations and carboxy- 
5 terminal truncations such that these mutations provide for proteins 
or protein fragments of diagnostic, therapeutic or prophylactic use. 

It is known that there is a substantial amount of 
redundancy in the various codons which code for specific amino 
acids. Therefore, this invention is also directed to those DNA 

10 sequences which contain alternative codons which code for the 

eventual translation of the identical amino acid. For purposes of this 
specification, a sequence bearing one or more replaced codons will be 
defined as a degenerate variation. Also included within the scope of 
this invention are mutations either in the DNA sequence or the 

15 translated protein which do not substantially alter the ultimate 
physical properties of the expressed protein. For example, 
substitution of valine for leucine, arginine for lysine, or asparagine 
for glutamine may not cause a change in functionality of the 
polypeptide. 

20 It is known that DNA sequences coding for a peptide may be 

altered so as to code for a peptide having properties that are different 
than those of the naturally-occurring peptide. Methods of altering the 
DNA sequences include, but are not limited to site directed mutagenesis* 
Examples of altered properties include but are not limited to changes in 

25 the affinity of an enzyme for a substrate or a receptor for a ligand. 
As used herein, "purified* and "isolated" are utilized 
interchangibly to stand for the proposition that the nucleic acid, 
protein, or respective fragment thereof in question has been 
substantially removed from its in vivo environment so that it may be 

30 manipulated by the skilled artisan, such as but not limited to nucleotide 
sequencing, restriction digestion, site-directed mutagenesis, and 
subcloning into expression vectors for a nucleic acid fragment as well as 
obtaining the protein or protein fragment in pure quantities so as to 
afford the opportunity to generate polyclonal antibodies, monoclononal 

35 antibodies, amino acid sequencing, and peptide digestion. Therefore, 
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the nucleic acids claimed herein may be present in whole cells or in cell 
lysates or in a partially purified or substantially purified form. A 
nucleic acid is considered substantially purified when it is purified away 
from environmental contaminants. Thus, a nucleic acid sequence 
5 isolated from cells is considered to be substantially purified when 
purified from cellular components by standard methods while a 
chemically synthesized nucleic acid sequence is considered to be 
substantially purified when purified from its chemical precursors. 

As used herein, "UCP3" may refer to any vertebrate form of UCP3, 

10 including but not limited to mouse or human UCP3. 

As used herein, "BAT" means brown adipose tissue. 
As used herein, "EST" means expressed sequence tag. 

As used herein, a "biologically active equivalent* or 
"functional derivative" of a wild type UCP3 possesses a biological 

15 activity that is substantially similar to the biological activity of the 
wild type UCP3. The term "functional derivative" is intended to 
include the "fragments," "mutants," "variants," "degenerate 
variants," "analogs" and "homologues" or to "chemical derivatives" of 
the wild type UCP3 protein. The term "fragment" is meant to refer to 

20 any polypeptide subset of wild type UCP3. The term "mutant" is 

meant to refer to a molecule that may be substantially similar to the 
wild type form but possesses distinguishing biological 
characteristics. Such altered characteristics include but are in no 
way limited to altered substrate binding, altered substrate affinity 

25 and altered sensitivity to chemical compounds affecting biological 
activity of the UCP3 or UCP3 functional derivative. The term 
"variant" is meant to refer to a molecule substantially similar in 
structure and function to either the entire wild type protein or to a 
fragment thereof A molecule is "substantially similar" to a wild type 

30 UCP3-like protein if both molecules have substantially similar 
structures or if both molecules possess similar biological activity. 
Therefore, if the two molecules possess substantially similar activity, 
they are considered to be variants even if the structure of one of the 
molecules is not found in the other or even if the two amino acid 

35 sequences are not identical. 
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The term "analog" refers to a molecule substantially 
similar in function to either the entire wild type UCP3-like protein or 
to a fragment thereof. 

Any of a variety of procedures may be used to clone 
5 UCP3. These methods include, but are not limited to, (1) a RACE 
PCR cloning technique (Frohman, et aL, 1988, Proc. Natl Acad. 
ScL85: 8998-9002), 5' and/or 3' RACE may be performed to 
generate a full length cDNA sequence. This strategy involves 
using gene-specific oligonucleotide primers for PCR amplification 

10 of UCP3 cDNA. These gene-specific primers are designed 
through identification of an expressed sequence tag (EST) 
nucleotide sequence which has been identified by searching any 
number of publicly available nucleic acid and protein databases; 
(2) direct functional expression of the UCP3 cDNA following the 

15 construction of an UCP3-contaimng cDNA library in an 

appropriate expression vector system; (3) screening a UCP3- 
containing cDNA library constructed in a bacteriophage or 
plasmid shuttle vector with a labeled degenerate oligonucleotide 
probe designed from the amino acid sequence of the UCP3 protein; 

20 (4) screening a UCP3-containing cDNA library constructed in a 
bacteriophage or plasmid shuttle vector with a partial cDNA 
encoding the UCP3 protein. This partial cDNA is obtained by the 
specific PCR amplification of UCP3 DNA fragments through the 
design of degenerate oligonucleotide primers from the amino acid 

25 sequence known for other UCP3 kinases which are related to the 
UCP3 protein; (5) screening an UCP3-containing cDNA library 
constructed in a bacteriophage or plasmid shuttle vector with a 
partial cDNA encoding the UCP3 protein. This strategy may also 
involve using gene-specific oligonucleotide primers for PCR 

30 amplification of UCP3 cDNA identified as an EST as described 
above; or (6) designing 5' and 3' gene specific oligonucleotides 
using SEQ ID NOS 1 or 2 as a template so that either the full 
length cDNA may be generated by known RACE techniques, or a 
portion of the coding region may be generated by these same 
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known RACE techniques to generate and isolate a portion of the 
coding region to use as a probe to screen one of numerous types of 
cDNA and/or genomic libraries in order to isolate a full length 
version of the nucleotide sequence encoding UCP3. 
5 It is readily apparent to those skilled in the art that 

other types of libraries, as well as libraries constructed from other 
cells types or species types, may be useful for isolating a UCP3- 
encoding DNA or a UCP3 homologue. Other types of libraries 
include, but are not limited to, cDNA libraries derived from other 

10 cells or cell lines other than human cells or tissue such as 

murine cells, rodent cells or any other such vertebrate host which 
may contain a UCP3-encoding DNA. Additionally a UCP3 gene 
may be isolated by oligonucleotide- or polynucleotide- based 
hybridization screening of a vertebrate genomic library, including 

15 but not limited to a human genomic library, a murine genomic 
library and a rodent genomic library, as well as concomitant 
human genomic DNA libraries. 

It is readily apparent to those skilled in the art that 
suitable cDNA libraries may be prepared from cells or cell lines 

20 which have UCP3 activity. The selection of cells or cell lines for 
use in preparing a cDNA library to isolate a UCP3 cDNA may be 
done by first measuring cell associated UCP3 activity using any 
known assay for UCP3 activity. 

Preparation of cDNA libraries can be performed by 

25 standard techniques well known in the art. Well known cDNA 
library construction techniques can be found for example, in 
Sambrook, et aL, 1989, Molecular Cloning: A Laboratory Manual; 
Cold Spring Harbor Laboratory, Cold Spring Harbor, New York. 
Complementary DNA libraries may also be obtained from 

30 numerous commercial sources, including but not limited to 
Clontech Laboratories, Inc. and Stratagent. 

It is also readily apparent to those skilled in the art 
that DNA encoding UCP3 may also be isolated from a suitable 
genomic DNA library. Construction of genomic DNA libraries 

35 can be performed by standard techniques well known in the art. 
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Well known genomic DNA library construction techniques can be 
found in Sambrook, et aL, supra. 

In order to clone the UCP3 gene by one of the 
preferred methods, the amino acid sequence or DNA sequence of 
5 UCP3 or a homologous protein may be necessary. To accomplish 
this, the UCP3 or a homologous protein may be purified and 
partial amino acid sequence determined by automated 
sequenators. It is not necessary to determine the entire amino 
acid sequence, but the linear sequence of two regions of 6 to 8 

10 amino acids can be determined for the PCR amplification of a 
partial UCP3 DNA fragment. Once suitable amino acid 
sequences have been identified, the DNA sequences capable of 
encoding them are synthesized. Because the genetic code is 
degenerate, more than one codon may be used to encode a 

15 particular amino acid, and therefore, the amino acid sequence 
can be encoded by any of a set of similar DNA oligonucleotides. 
Only one member of the set will be identical to the UCP3 sequence 
but others in the set will be capable of hybridizing to UCP3 DNA 
even in the presence of DNA oligonucleotides with mismatches. 

20 The mismatched DNA oligonucleotides may still sufficiently 

hybridize to the UCP3 DNA to permit identification and isolation 
of UCP3 encoding DNA. Alternatively, the nucleotide sequence of 
a region of an expressed sequence may be identified by searching 
one or more available genomic databases. Gene-specific primers 

25 may be used to perform PCR amplification of a cDNA of interest 
from either a cDNA library or a population of cDNAs. As noted 
above, the appropriate nucleotide sequence for use in a PCR-based 
method may be obtained from SEQ ID NOS: 1 or 2, either for the 
purpose of isolating overlapping 5' and 3' RACE products for 

30 generation of a full-length sequence coding for UCP3, or to isolate 
a portion of the nucleotide sequence coding for UCP3 for use as a 
probe to screen one or more cDNA- or genomic-based libraries to 
isolate a full-length sequence encoding UCP3 or UCP3-like 
proteins. 
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In an exemplified method, the human UCP3 full length cDNA of 
the present invention was generated by a novel method of cDNA 
screening. Briefly, the extension of partial cDNA sequences have 
historically been achieved with one or both of the two commonly used 
5 methods: filter screening of cDNA libraries by hybridization with labeled 
probes, and 5'- and 3'-RACE with total cellular mRNA by PCR. The first 
method is effective but laborious and slow while the latter method is fast 
but limited in efficiency. This RACE protocol is hindered by limited 
length of extension due to the use of the entire cellular mRNA 

10 population in a single reaction. Since smaller fragments are amplified 
much more efficiently than larger fragments by PCR in the same 
reaction, PCR products obtained using the second method are often quite 
small. This portion of the present invention marks an improvement 
upon known methods of cDNA library screening by initially 

15 constructing and subdividing cDNA libraries followed by isolating 5'- 
and 3'- flanking fragments by PCR. Since each pool is unlikely to 
contain more than one clone for a given gene which is low to moderately 
expressed, competition between large and small PCR products in one 
pool does not exist, making it possible to isolate fragments of various 

20 sizes. One defininte advantage of the method disclosed in this 

specification is the efficiency, throughput, and its potential to isolate 
alternatively spliced cDNA forms. 

Therefore, this portion of the invention relates to a process for 
rapid extension of a partial cDNA sequence based on subdividing a 

25 primary cDNA library and DNA amplification by polymerase chain 
reaction (PCR). A cDNA library is constructed with cDNA primed by 
random, ohgo-dT or a combination of both random and ohgo-dT primers 
and then subdivided into pools at approximately 10,000 -20,000 clones per 
pool. Each pool is amplified separately and therefore represents an 

30 independent portion of the cDNA molecules from the original mRNA 
source. Samples from all the pools are collected and transferred into 96- 
well plates. To extend a partial cDNA sequence, such as SEQ ID NO:l or 
2, positive pools containing the partial cDNA sequence are first 
identified by PCR with a pair of primers complementary to the cDNA 

35 sequence. Each positive pool in the library contains an independent 
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clone of the cDNA sequence; within each clone are embedded the partial 
cDNA sequence and its flanking fragments. The flanking fragments 
are isolated by PCR with primers complementary to the known vector 
and cDNA sequences and then sequenced directly. The DNA sequences 
5 from these fragments plus the original partial cDNA sequence are 

assembled into a continuous fragment, resulting in the extension of the 
partial cDNA sequence and the eventual determination of its full-length 
gene sequence by repeating the process, if necessary, until a complete 
open reading frame is obtained. 

10 The fundamental principle of this process is to subdivide a 

complex library into pools of about 10,000 to about 20,000 clones. A 
library of two million primary clones, a number large enough to cover 
most mRNA transcripts expressed in the tissue, can be subdivided into 
188 pools and stored in two 96-well plates. Since the number of 

15 transcripts for most genes is fewer than one copy per -10,000 transcripts 
in total cellular mRNA, each pool is unlikely to contain more than one 
clone for a given cDNA sequence. Such reduced complexity makes it 
possible to use PCR to isolate flanking fragments of partial cDNA 
sequences larger than those obtained by known methods. 

20 The foregoing process is exemplifed for isolation and 

characterization of a full-length cDNA encoding human UCP3. Briefly, 
a random and oligo dT primed fetal brain cDNA library consisting of 
approximately 4 million primary clones was constructed in the plasmid 
vector pBluescript (Stratagene, LaJolla, CA). The primary clones were 

25 subdivided into 188 pools with each pool containing -20,000 clones. Each 
pool was amplified separately and the resulting plasmid pools were 
collected and transferred into two 96-well plates. Primer pairs from the 
5' and 3' portion of SEQ ID NO:l (EST-AA192136), 5' and 3' 
oligonucleotides from SEQ ID NO:4 (EST-AA192553) as well as 

30 oligonucleotide primers both 5' and 3' of the polylinker sequence of the 
vector (in this case, pBluescript SK") were used to scan this human fetal 
brain cDNA library distributed in a 96-well plate separately. Initial 
positive pools were identifed with the 5' and 3' primers from AA192136 
(SEQ ID NOS: 5 and 6) and the pBluescript primers (SEQ ID NOS:9 and 

35 10). The positive pools were scanned again by PCR with both pBluescript 
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primers as well as the 5' and 3' primers associated with the 3' EST, 
AA192553 (primers 3U3F and 3U3R). This step showed that scanning 
with 838F (pBSK", SEQ ID NO: 9) and 3U3F (hybridizing to the 3' EST 
AA192553; SEQ ID NO:8), generated a 2.4 Kb cDNA fragment. This 
5 cDNA fragment was subcloned into the vector pTA2.1 (Invitrogen, San 
Diego, CA, USA) by TA cloning. Four positive clones, the original PCR 
fragment, as well the I.M.A.G.E. clone #628529 (SEQ ID NOS:l and 2) 
were sequenced by primer walking. The sequences were then 
assembled into a contig of 2340 base pairs which is as set forth in SEQ ID 

10 NO: 11 and Figure 1A and Figure IB. This sequence contains an open 
reading frame that encodes a polypeptide of 312 amino acids as set forth 
in SEQ ID NO:12 and Figures 2A-2D and Figure 3. 

The cDNA as set forth in SEQ ID NO:ll was identified as a full- 
length cDNA fragment from a single positive pool. Figure 5 shows the 

15 vector and EST (gene) specific primers used to isolate the human UCP3 
gene. In this instance, the entire coding region and 3' untranslated 
region was retrieved by the 838F and 3U3F primer combinantion. 
However, the skilled artisan, aided with this specification, will 
understand the more far reaching cDNA cloning process disclosed 

20 herein: multiple primer combinations from an EST or other partial 
cDNA sequence, in combination with flanking vector primer 
oligonucleotides may be used to "walk" in both directions away from the 
internal gene specific sequence and respective primers such that a 
contig representing a full length cDNA may be constructed. This 

25 procedure relies on the ability to screen multiple pools which comprise a 
representive portion of the total cDNA library. This procedure is not 
dependent upon using a cDNA library with directionally cloned inserts. 
Instead, both 5' and 3' vector and gene specific primers are added and a 
contig map is constructed from additional screening of positive pools 

30 using both vector primers and gene specific primers. Of course, these 
gene specific primers are intially constructed from a known nucleic acid 
fragment such as an expressed sequence tag. However, as the walk 
continues, gene specific primers are utilized from the 5' and 3' 
boundaries of the newly identified regions of the cDNA. As the walk 
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continues, there is still no requirement that the vector orientation of a 
yet unidentified fragment be known. Instead, all combinations are 
tested on a positive pool and the actual vector orientation is determined 
by the ability of certain vector/gene specific primers to generate the 
5 predicted PCR fragment. A full-length cDNA may then be easily 
constructed by known subcloning procedures. 

A variety of mammalian expression vectors may be used to 
express recombinant UCP3 in mammalian cells. Expression vectors 
are defined herein as DNA sequences that are required for the 

10 transcription of cloned DNA and the translation of their mRNAs in an 
appropriate host. Such vectors can be used to express eukaryotic DNA 
in a variety of hosts such as bacteria, bluegreen algae, plant cells, insect 
cells and animal cells. Specifically designed vectors allow the shuttling 
of DNA between hosts such as bacteria-yeast or bacteria-animal cells. 

15 An appropriately constructed expression vector should contain: an 

origin of replication for autonomous replication in host cells, selectable 
markers, a limited number of useful restriction enzyme sites, a 
potential for high copy number, and active promoters. A promoter is 
defined as a DNA sequence that directs RNA polymerase to bind to DNA 

20 and initiate RNA synthesis. A strong promoter is one which causes 
mRNAs to be initiated at high frequency. Expression vectors may 
include, but are not limited to, cloning vectors, modified cloning vectors, 
specifically designed plasmids or viruses. 

Commercially available mammalian expression vectors 

25 which may be suitable for recombinant UCP3 expression, include but 
are not limited to, pcDNA3.1 (Invitrogen), pBlueBacHis2 (Invitrogen), 
pLITMUS28, pLITMUS29, pLITMUS38 and pLITMUS39 (New England 
Bioloabs), pcDNAI, pcDNAJamp (Invitrogen), pcDNA3 (Invitrogen), 
pMClneo (Stratagene), pXTl (Stratagene), pSG5 (Stratagene), EBO- 

30 pSV2-neo (ATCC 37593) pBPV-l(8-2) (ATCC 37110), pdBPV-MMTneo(342- 
12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), 
pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), and 1ZD35 (ATCC 
37565). 

A variety of bacterial expression vectors may be used to 
35 express recombinant UCP3 in bacterial cells. Commercially available 
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bacterial expression vectors which may be suitable for recombinant 
UCP3 expression include, but are not limited to pCR2.1 (Invitrogen), 
pETlla (Novagen), lambda gtll (Invitrogen), pcDNAII (Invitrogen), 
pKK223-3 (Pharmacia). 
5 A variety of fungal cell expression vectors may be used to 

express recombinant UCP3 in fungal cells. Commercially available 
fungal cell expression vectors which may be suitable for recombinant 
UCP3 expression include but are not limited to pYES2 (Invitrogen), 
Pichia expression vector (Invitrogen). 

10 A variety of insect cell expression vectors may be used to 

express recombinant receptor in insect cells. Commercially available 
insect cell expression vectors which may be suitable for recombinant 
expression of UCP3 include but are not limited to pBlueBacIII and 
pBlueBacHis2 (Invitrogen). 

15 An expression vector containing DNA encoding a UCP3- 

like protein may be used for expression of UCP3 in a recombinant host 
cell. Recombinant host cells may be prokaryotic or eukaryotic, 
including but not limited to bacteria such as E. coli, fungal cells such as 
yeast, mammalian cells including but not limited to cell lines of human, 

20 bovine, porcine, monkey and rodent origin, and insect cells including 
but not limited to Drosophila and silkworm derived cell lines. Cell lines 
derived from mammalian species which may be suitable and which are 
commercially available, include but are not limited to, L cells 
L-M(TK-) (ATCC CCL 1.3), L cells L-M (ATCC CCL 1.2), Saos-2 (ATCC 

25 HTB-85), 293 (ATCC CRL 1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 
70), COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC 
CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC 
CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and MRC-5 
(ATCC CCL 171). 

30 The expression vector may be introduced into host cells via 

any one of a number of techniques including but not limited to 
transformation, transfection, protoplast fusion, and electroporation. 
The expression vector-containing cells are individually analyzed to 
determine whether they produce UCP3 protein. Identification of UCP3 

35 expressing cells may be done by several means, including but not 
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limited to immunological reactivity with anti-UCP3 antibodies, and the 
presence of host cell-associated UCP3 activity. 

The cloned UCP3 cDNA obtained through the methods 
described above may be recombinantly expressed by molecular cloning 
5 into an expression vector (such as pcDNA3.1, pCR2.1, pBlueBacHis2 
and pLITMUS28) containing a suitable promoter and other appropriate 
transcription regulatory elements, and transferred into prokaryotic or 
eukaryotic host cells to produce recombinant UCP3. Techniques for 
such manipulations can be found described in Sambrook, et aL, supra , 

10 are discussed at length in the Example section and are well known and 
easily available to the artisan of ordinary skill in the art. 

Expression of UCP3 DNA may also be performed using in 
vitro produced synthetic mRNA. Synthetic mRNA can be efficiently 
translated in various cell-free systems, including but not limited to 

15 wheat germ extracts and reticulocyte extracts, as well as efficiently 
translated in cell based systems, including but not limited to 
microinjection into frog oocytes, with microinjection into frog oocytes 
being preferred. 

To determine the UCP3 cDNA sequence(s) that yields 

20 optimal levels of UCP3 protein, UCP3 cDNA molecules including but not 
limited to the following can be constructed: the full-length open reading 
frame of the UCP3 cDNA and various constructs containing portions of 
the cDNA encoding only specific domains of the protein or rearranged 
domains of the protein. All constructs can be designed to contain none, 

25 all or portions of the 5' and/or 3' untranslated region of UCP3. UCP3 
activity and levels of protein expression can be determined following the 
introduction, both singly and in combination, of these constructs into 
appropriate host cells. Following determination of the UCP3 cDNA 
cassette yielding optimal expression in transient assays, this UCP3 

30 cDNA construct is transferred to a variety of expression vectors 

-(including recombinant viruses), including but not limited to those for 
mammalian cells, plant cells, insect cells, oocytes, bacteria, and yeast 
cells. 

Levels of UCP3 protein in host cells is quantified by a variety 
35 of techniques including, but not limited to, immunoaffinity and/or 
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ligand affinity techniques. UCP3-specific affinity beads or UCP3-specific 
antibodies are used to isolate ^S-methioirine labeled or unlabelled UCP3 
protein. Labeled UCP3 protein is analyzed by SDS-PAGE. Unlabelled 
UCP3 protein is detected by Western blotting, ELISA or RIA assays 
5 employing UCP3 specific antibodies. 

Following expression of UCP3 in a host cell, UCP3 protein 
may be recovered to provide UCP3 in active form. Several UCP3 
purification procedures are available and suitable for use. Recombinant 
UCP3 may be purified from cell lysates and extracts, or from 

10 conditioned culture medium, by various combinations of, or individual 
application of salt fractionation, ion exchange chromatography, size 
exclusion chromatography, hydroxylapatite adsorption chromatography 
and hydrophobic interaction chromatography. 

In addition, recombinant UCP3 can be separated from 

15 other cellular proteins by use of an immuno-affinity column made with 
monoclonal or polyclonal antibodies specific for full length UCP3, or 
polypeptide fragments of UCP3. Additionally, polyclonal or monoclonal 
antibodies may be raised against a synthetic peptide (usually from about 
9 to about 25 amino acids in length) from a portion of the protein as 

20 disclosed in SEQ ID NO:12. Monospecific antibodies to UCP3 are 
purified from mammalian antisera containing antibodies reactive 
against UCP3 or are prepared as monoclonal antibodies reactive with 
UCP3 using the technique of Kohler and Milstein (1975, Nature 256: 495- 
497). Monospecific antibody as used herein is defined as a single 

25 antibody species or multiple antibody species with homogenous binding 
characteristics for UCP3. Homogenous binding as used herein refers to 
the ability of the antibody species to bind to a specific antigen or epitope, 
such as those associated with the UCP3, as described above. UCP3 
specific antibodies are raised by immunizing animals such as mice, 

30 rats, guinea pigs, rabbits, goats, horses and the like, with an 

appropriate concentration of UCP3 or UCP3 synthetic peptide either 
with or without an immune adjuvant. 

Preimmune serum is collected prior to the first 
immunization. Each animal receives between about 0.1 mg and about 

35 1000 mg of UCP3 associated with an acceptable immune adjuvant. Such 
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acceptable adjuvants include, but are not limited to, Freund's complete, 
Freund's incomplete, alum-precipitate, water in oil emulsion 
containing Corynebacterium parvum and tKNA. The initial 
immunization consists of the UCP3 protein or UCP3 synthetic peptide 
5 in, preferably, Freund's complete adjuvant at multiple sites either 

subcutaneously (SC), intraperitoneally (IP) or both. Each animal is bled 
at regular intervals, preferably weekly, to determine antibody titer. The 
animals may or may not receive booster injections following the initial 
immunization. Those animals receiving booster injections are 

10 generally given an equal amount of UCP3 in Freund's incomplete 

adjuvant by the same route. Booster injections are given at about three 
week intervals until maximal titers are obtained. At about 7 days after 
each booster immunization or about weekly after a single 
immunization, the animals are bled, the serum collected, and aliquots 

15 are stored at about -20°C. 

Monoclonal antibodies (mAb) reactive with UCP3 are 
prepared by immunizing inbred mice, preferably Balb/c, with UCP3. 
The mice are immunized by the IP or SC route with about 1 mg to about 
100 mg, preferably about 10 mg, of UCP3 in about 0.5 ml buffer or saline 

20 incorporated in an equal volume of an acceptable adjuvant, as discussed 
above. Freund's complete adjuvant is preferred. The mice receive an 
initial immunization on day 0 and are rested for about 3 to about 30 
weeks. Immunized mice are given one or more booster immunizations 
of about 1 to about 100 mg of UCP3 in a buffer solution such as phosphate 

25 buffered saline by the intravenous (IV) route. Lymphocytes, from 

antibody positive mice, preferably splenic lymphocytes, are obtained by 
removing spleens from immunized mice by standard procedures known 
in the art. Hybridoma cells are produced by mixing the splenic 
lymphocytes with an appropriate fusion partner, preferably myeloma 

30 cells, under conditions which will allow the formation of stable 
hybridomas. Fusion partners may include, but are not limited to: 
mouse myelomas P3/NSl/Ag 4-1; MPC-11; S-194 and Sp 2/0, with Sp 2/0 
being preferred. The antibody producing cells and myeloma cells are 
fused in polyethylene glycol, about 1000 mol. wt., at concentrations from 

35 about 30% to about 50%. Fused hybridoma cells are selected by growth in 
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hypoxanthine, thymidine and aminopterin supplemented Dulbecco's 
Modified Eagles Medium (DMEM) by procedures known in the art. 
Supernatant fluids are collected form growth positive wells on about 
days 14, 18, and 21 and are screened for antibody production by an 
5 immunoassay such as solid phase immunoradioassay (SPIRA) using 
UCP3 as the antigen. The culture fluids are also tested in the 
Ouchterlony precipitation assay to determine the isotype of the mAb. 
Hybridoma cells from antibody positive wells are cloned by a technique 
such as the soft agar technique of MacPherson, 1973, Soft Agar 

10 Techniques, in Tissue Culture Methods and Applications, Kruse and 
Paterson, Eds., Academic Press. 

Monoclonal antibodies are produced in vivo by injection of 
pristine primed Balb/c mice, approximately 0.5 ml per mouse, with 
about 2 x 10^ to about 6 x 10^ hybridoma cells about 4 days after priming. 

15 Ascites fluid is collected at approximately 8-12 days after cell transfer 
and the monoclonal antibodies are purified by techniques known in the 
art. 

In vitro production of anti-UCP3 mAb is carried out by 
growing the hydridoma in DMEM containing about 2% fetal calf serum 
20 to obtain sufficient quantities of the specific "mAb. The mAb are purified 
by techniques known in the art. 

Antibody titers of ascites or hybridoma culture fluids are 
determined by various serological or immunological assays which 
include, but are not limited to, precipitation, passive agglutination, 
25 enzyme-linked immunosorbent antibody (ELISA) technique and 

radioimmunoassay (RIA) techniques. Similar assays are used to detect 
the presence of UCP3 in body fluids or tissue and cell extracts. 

It is readily apparent to those skilled in the art that the 
above described methods for producing monospecific antibodies may be 
30 utilized to produce antibodies specific for UCP3 polypeptide fragments, 
or full-length UCP3 polypeptide. 

UCP3 antibody affinity columns are made by adding the 
antibodies to Affigel-10 (Biorad), a gel support which is pre-activated 
with N-hydroxysuccinimide esters such that the antibodies form 
35 covalent linkages with the agarose gel bead support. The antibodies are 
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then coupled to the gel via amide bonds with the spacer arm. The 

remaining activated esters are then quenched with 1M ethanolamine 

HC1 (pH 8). The column is washed with water followed by 0.23 M 

glycine HC1 (pH 2,6) to remove any non-conjugated antibody or 

5 extraneous protein. The column is then equilibrated in phosphate 

buffered saline (pH 7.3) and the cell culture supernatants or cell extracts 

containing UCP3 or UCP3 fragments are slowly passed through the 

column. The column is then washed with phosphate buffered saline 
until the optical density (A2gQ) fe^ 8 to background, then the protein is 

10 eluted with 0.23 M glycine-HCl (pH 2.6). The purified UCP3 protein is 
then dialyzed against phosphate buffered saline. 

The novel UCP3 of the present invention is suitable for use 
in an assay procedure for the identification of compounds which 
modulate UCP3 activity. Modulating UCP3 activity, as described herein 

15 includes the inhibition or activation of the protein and also includes 
directly or indirectly affecting the cell cycle regulatory properties 
associated with UCP3 activity. Compounds which modulate UCP3 
activity include agonists, antagonists, inhibitors, activators, and 
compounds which directly or indirectly affect regulation of the UCP3 

20 activity and/or the UCP3 uncoupling activity. 

The UCP3 of the present invention may be obtained from 
both native and recombinant sources for use in an assay procedure to 
identify UCP3 modulators. In general, an assay procedure to identify 
UCP3 modulators will contain the UCP3-protein of the present 

25 invention, and a test compound or sample which contains a putative 

UCP3 modulator. The test compounds or samples may be tested directly 
on, for example, purified UCP3 protein whether native or recombinant, 
subcellular fractions of UCP3-producing cells whether native or 
recombinant, and/or whole cells expressing the UCP3 whether native or 

30 recombinant. The test compound or sample may be added to the UCP3 
in the presence or absence of a known UCP3 modulator. The 
modulating activity of the test compound or sample may be determined 
by, for example, analyzing the ability of the test compound or sample to 
bind to UCP3 protein, activate the protein, inhibit UCP3 activity, inhibit 
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or enhance the binding of other compounds to the UCP3 protein, 
modifying receptor regulation, or modifying an intracellular activity. 

Therefore, the present invention also relates to 
subcellular membrane fractions of the recombinant host cells (both 
5 prokayotic and eukaryotic as well as both stably and transiently 
transformed cells) comprising the nucleic acids of the present 
invention. These subcellular membrane fractions will comprise 
UCP3 at levels substantially above wild type levels and hence will be 
useful in various assays described throuhgout this specification. 

10 The identification of modulators of UCP3 activity will be 

useful in treating disease states involving, as an example but not a 
limitation, obesity and diabetes, by manipulating the interrelated 
process of balancing food intake, energy expenditure and glucose 
metabolism within the patient. Therefore, modulators to treat 

15 hyperactive conditions of energy expenditure which originate in the 
mitochondria of skeletal muscle will also be within the purview of the 
present invention. 

The present invention is also directed to methods for 
screening for compounds which modulate the expression of DNA or 

20 RNA encoding a UCP3 protein of the present invention or which 

modulates the function of a such a UCP3 protein. Compounds which 
modulate these activities may be DNA, RNA, peptides, proteins, or non- 
proteinaceous organic molecules. Compounds may modulate by 
increasing or attenuating the expression of DNA or RNA encoding the 

25 UCP3 protein, or the function of a UCP3 protein. Compounds that 

modulate the expression of DNA or RNA encoding the UCP3 protein or 
the biological function thereof may be detected by a variety of assays. 
The assay may be a simple a yes/no n assay to determine whether there is 
a change in expression or function. The assay may be made 

30 quantitative by comparing the expression or function of a test sample 
with the levels of expression or function in a standard sample. Kits 
containing modified UCP3, antibodies to UCP3, or modified UCP3 
protein may be prepared by known methods for such uses. 

The DNA molecules, RNA molecules, recombinant protein 

35 and antibodies of the present invention may be vised to screen and 
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measure levels of UCP3 DNA, RNA or protein. The recombinant 
proteins, DNA molecules, RNA molecules and antibodies lend 
themselves to the formulation of kits suitable for the detection and typing 
of UCP3. Such a kit would comprise a compartmentalized carrier 
5 suitable to hold in close confinement at least one container. The carrier 
would further comprise reagents such as recombinant UCP3 protein or 
anti-UCP3 antibodies suitable for detecting UCP3. The carrier may also 
contain a means for detection such as labeled antigen or enzyme 
substrates or the like. 

10 Pharmaceutically useful compositions comprising 

modulators of UCP3 may be formulated according to known methods 
such as by the admixture of a pharmaceutically acceptable carrier. 
Examples of such carriers and methods of formulation may be found in 
Remington's Pharmaceutical Sciences. To form a pharmaceutically 

15 acceptable composition suitable for effective administration, such 
compositions will contain an effective amount of the protein, DNA, 
RNA, or modified UCP3. 

Therapeutic or diagnostic compositions of the invention are 
administered to an individual in amounts sufficient to treat or diagnose 

20 disorders. The effective amount may vary according to a variety of 

factors such as the individual's condition, weight, sex and age. Other 
factors include the mode of administration. 

The pharmaceutical compositions may be provided to the 
individual by a variety of routes such as subcutaneous, topical, oral and 

25 intr amus cul ar . 

The term "chemical derivative* describes a molecule that 
contains additional chemical moieties which are not normally a part of 
the base molecule. Such moieties may improve the solubility, half-life, 
absorption, etc. of the base molecule. Alternatively the moieties may 

30 attenuate undesirable side effects of the base molecule or decrease the 
toxicity of the base molecule. Examples of such moieties are described 
in a variety of texts, such as Remington's Pharmaceutical Sciences. 

Compounds identified according to the methods disclosed 
herein may be used alone at appropriate dosages. Alternatively, co- 
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administration or sequential administration of other agents may be 
desirable. 

The present invention also has the objective of providing 
suitable topical, oral, systemic and parenteral pharmaceutical 
5 formulations for use in the novel methods of treatment of the present 
invention. The compositions containing compounds identified 
according to this invention as the active ingredient can be administered 
in a wide variety of therapeutic dosage forms in conventional vehicles 
for administration. For example, the compounds can be administered 

10 in such oral dosage forms as tablets, capsules (each including timed 
release and sustained release formulations), pills, powders, granules, 
elixirs, tinctures, solutions, suspensions, syrups and emulsions, or by 
injection. Likewise, they may also be administered in intravenous (both 
bolus and infusion), intraperitoneal, subcutaneous, topical with or 

15 without occlusion, or intramuscular form, all using forms well known 
to those of ordinary skill in the pharmaceutical arts. 

Advantageously, compounds of the present invention may 
be administered in a single daily dose, or the total daily dosage may be 
administered in divided doses of two, three or four times daily. 

20 Furthermore, compounds for the present invention can be administered 
in intranasal form via topical use of suitable intranasal vehicles, or via 
transdermal routes, using those forms of transdermal skin patches well 
known to those of ordinary skill in that art. To be administered in the 
form of a transdermal delivery system, the dosage administration will, 

25 of course, be continuous rather than intermittent throughout the dosage 
regimen. 

For combination treatment with more than one active 
agent, where the active agents are in separate dosage formulations, the 
active agents can be administered concurrently, or they each can be 
30 administered at separately staggered times. 

The dosage regimen utilizing the compounds of the present 
invention is selected in accordance with a variety of factors including 
type, species, age, weight, sex and medical condition of the patient; the 
severity of the condition to be treated; the route of administration; the 
35 renal and hepatic function of the patient; and the particular compound 
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thereof employed. A physician or veterinarian of ordinary skill can 
readily determine and prescribe the effective amount of the drug 
required to prevent, counter or arrest the progress of the condition. 
Optimal precision in achieving concentrations of drug within the range 
5 that yields efficacy without toxicity requires a regimen based on the 
kinetics of the drug's availability to target sites. This involves a 
consideration of the distribution, equilibrium, and elimination of a 
drug. 

By computer analysis of a genomic database, molecular 
10 cloning and DNA sequencing a novel member of the human UCP3 gene 
family has been identified. This new cDNA fragment encodes a novel 
human uncoupling protein which may be useful as a gene therapy 
vehicle or a drug target in such diseases as obesity and diabetes. 

Northern hybridization experiments with RNA from 
15 various cell and tissues indicates that UCP3 is expressed in various 
human tissue, including skeletal muscle, brain, and heart, with 
predominant expression in skeletal muscle. 

The following examples are provided as illustrative of the 
present invention without, however, limiting the same thereto. 

20 

EXAMPLE 1: 

Identification of Partial DNA Fragments Encoding Human UCP3 

25 The nucleotide sequence for human UCP1 (Cassard et al., 1990, J. 

Cell. Biochem. 43: 255-264) is deposited in Genbank and is assigned 
accession number U28480. The nucleotide sequence for human UCP2 
(Fleury et al., 1997, Nature Genetics 15: 269-272) ) is deposited in 
Genbank and is assigned accession number U76367. The nucleotide 

30 sequences of UCP1 and UCP2 were used to search the Genbank EST 

sequences by Blastn (see Altschull et al., 1990, J. Mol. Biol. 215: 403-410). 
The Blastn search identified an expressed sequence tag (EST) detected 
among the Merck- Washington University ESTs with accession number 
AA192136, which is: 



-32- 



GGTGACCTACGACATCCTCAAGGAGAAGCTGCTGGACTACCACCTGCTCACTGACAACT 
TCCCCTGCCACTTTGTCTCTGCCTTTGGAGCCGGCTTC 

CCGGTGGACGTGGTGAAGACCCGGTATATGAACTCACCTCCAGGCCAGTACTTCAGCCC 
CCTCGACTGTATGATAAAGATGGTGGCCCAGGAGCGCCACCAGCCTTCTACAAGGGATT 
5 TACACCCTCCTTTTTGCGTTTC 

AGCTGAAACGGGCCCTGATGAAAGTCCAGATGTTACGGGAATCACCGTTTTGAACAAGA 
CAAGAAGGCCACTGGTAGCTAACGTGTCCGAAACCAGTTAAGAATGGAAG 

(SEQ ID NO: 1). 

An a NT residue is added at base number 249 in order to preserve the open 
10 reading frame of this EST. This corrected version of AA192136 is as 
follows: 

GGTGACCTACGACATCCTCAAGGAGAAGCTGCTGGACTACCACCTGCTCACTGACAACT 
TCCCCTGCCACTTTGTCTCTGCCTTTGGAGCCGGCTTCTC 

CCGGTGGACGTGGTGAAGACCCGGTATATGAACTCACCTCCAGGCCAGTACTTCAGCCC 
15 CCTCGACTGTATGATAAAGATGGTGGCCCAGGAGCGCCACCAGCCTTCTACAAGGGATT 
TACACCCTCCTTOTTTGCGTTTGG 

CAGCTGAAACGGGCCCTGATGAAAGTCCAGATGTTACGGGAATCACCGTTTTC 
ACAAGAAGGCCACTGGTAGCTAACGTGTCCGAAACCAGTTAAGAATGGAAG(SEQ ID 

NO:2). 

20 SEQ ID NO:l is a component of Stratagene® muscle cDNA library 

937209 and is identified as cDNA Image Clone No. 628529. This 
Stratagene muscle cDNA library was generated by standard methods 
and cDNAs are cloned unidirectionally into EcoRI/XhoI digested 
pBS(SK ) plasmid. The mRNA for this library was isolated from the 

25 skeletal muscle of an adult human patient suffering from malignant 
hyperthermia. This cDNA clone is publically available by Genbank 
Accession No. AA192136, Image Clone ID No. 628529, and Washington 
University Clone ED No. zq02d09.rl. This construct is available from 
Research Genetics, Inc., 2130 Memorial Parkway SW, Hunstville, AL 

30 35801 (http://www. resgen.com). 

SEQ ID NO:l was used as a query to search Genbank with Blastn. 
A second EST, with accession number Z28895 (and assigned Genbank 
number HSBB6C051) was identified and is disclosed throughout this 
specification as SEQ ID NO:3, which is: 
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CGAGCAGCTGAAACGGGCCCTGATGAAAGTCCAGATGTTACGGGNATCACCGTTTTGAA 

CAAGACAAGAAGGCCACTGGTAGCTAACGTNTCCGAAACCAGTTAAGANT^ 

CGGTCCATCCACGNACACATGGACACAGACCCACACATNTT (SEQ ID N0:3) . 
The EST disclosed as SEQ ID NO:2 shows an open reading frame 
5 encoding a novel protein fragment showing homology to human UCP1 

and human UCP2. SEQ ID NO:2 represents the 5' portion of the cDNA 

Image Clone No. 628529. The 3' sequence read of cDNA Image Clone 

No. 628529 has the accession number aal92553, which is as follows: 

TTTTTTTTGTTCTTACTC 
10 TTCCCGAGAATTCCGAGTCCTGCTACTTTAGGTTCTTC 

CCCCAAGCCCAACAATCCTTTGAGGTACTCATGATTGAGCGCGTGGTGGGGGGGGGTGG 

GGAAGAGGCTGCATGGGGGTGGGGCTCCTGTGGCT 

GTCCCCAAGTCTCTGGATCCTTTGGTC 

TCTTCCCTGGCAACTCCTCTCTC 
15 GTTAGATTTACACAGAGCTTTTGCTTTTATAAAGTGCGTTC 

GCATGTCATAGCACCCCTGGTGAGGTGGACAG 

GG AAAG TNGGGG (SEQ ID NO:4). 

20 EXAMPLE 2; 

Isolation and Characterization of DNA Fragments 
Encoding Full-Length Human UCP3 

The exemplified full length cDNA of the present invention was 
25 generated by a novel method of cDNA screening as described in the 
Detailed Description of the Invention. 

The foregoing process is exemplified for isolation and 
characterization of a full-length cDNA encoding human UCP3. Briefly, 
a random oligo dT primed fetal brain cDNA library consisting of 
30 approximately 4.0 million primary clones was constructed in the 
plasmid vector pBluescript (Stratagene, LaJolla, CA). The primary 
clones were subdivided into 188 pools with each pool containing 
approximately 20,000 clones. Each pool was amplified separately and 
the resulting plasmid pools were collected and transferred into two 
35 96- well plates. 
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Primer pairs 5'-AAGCTGCTGGACTACCACCTGCTC-3' (F29; 
SEQ ID N0:5), 5'-TACTGGCCTGGAGGTGAGTTCA-3' (R147: SEQ ID 
N0:6) as designed from AA192136; 

5'-CCAAGCCCAACAATCCTTTGA-3' (3U3F; SEQ ID NO:7) and 
5 5'-CCAAAGGATCCAGAGACTTGG-3' (3U3R; SEQ ID NO:8) as 
designed from AA192553, and pBluescript SK vector primers 838F 
(5'-TTGTGTGGAATTGTGAGCGGATAAC-3'; SEQ ID NO:9) and 578R 
(5'-CCAGGGTTTTCCCAGTCACGAC-3'; SEQ ID NO: 10) were used to 
scan by PCR a human fetal brain cDNA library distributed in a 96- well 

10 plate separately. Both the F29/R147-838F/578R scan and the 3U3F/3U3R- 
838F/578R scans scored positive in pools D10, E10 and F2. Gene specific 
primer 3U3F (SEQ ID NO:8, the 5' oligonucleotide from the 3' EST 
(AA192553, SEQ ID NO:4) and pBluescript SK vector primer 838F SEQ ID 
NO:9) amplified a fragment of about 2.4 kb from both pool E10 and pool 

15 F2. The fragments from E10 and F2 were gel purified, mixed, subcloned 
into the vector pTA2.1 (Invitrogen, San Diego, CA, USA) by TA cloning. 
Four positive clones, the original PCR fragment, as well the I.M.A.G.E. 
clone #628529 (SEQ ID NOS:l and 2) were sequenced by primer walking. 
The sequences were then assembled into a contig of 2340 base pairs 

20 which is as set forth in SEQ ID NO:ll and shown in Figure 1A-1B. This 
sequence contains an open reading frame that encodes a polypeptide of 
312 amino acids as set forth in SEQ ID NO:12 and shown in Figure 2A- 
2D and Figure 3. 



25 EXAMPLE 3: 

Construction of a Yeast Expression Vector and Transformed Yeast Cell 

Line Expressing Human UPC3 

A pair of primers, UCP3.-9F.RI20mer 
30 (5'CATAGAATTCCAGGACTATGGTTGGAC3', SEQ ID NO:13), and 

UCP3.1308R.XhoI20mer 
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(5T5ATTCTCGAGCTACCAGTGGCCTTCTTG3', SEQ ID N0:14) were 
used to generate a PCR product of human UCP3 tagged with EcoRI and 
Xhol sites at the 5' and 3' ends, respectively, using pCR2.1-UCP3.1 as the 
template. The PCR product was digested with EcoRI and Xhol and 
5 purified from an agarose gel. This fragment was ligated into a yeast 
expression vector pYES2 (Invitrogen, San Diego, CA) that was also 
digested with EcoRI and Xhol such that UCP3 is under the control of S. 
cerevisiae GAL1 promoter. Multiple recombinants isolated after 
transformation into E. coli were picked and sequenced using three UCP3 

10 internal primers in combinatition with two pYES2 vector primers 
(5'CCCGGATCGGACTACTAGCA, (SEQ ID NO:15); 
5'GGGGGGAGGGCGTGAATGTAA; [SEQ ID NO: 16]). Mutation-free 
clones of pYES2-UCP3 were then transformed into two Saccharomyces 
cerevisiae strains INVScl (MATa, his3Dl, leu2, trpl-289, ura3-52) and 

15 YPH499(iHA7b, ura3-52, lys2-801 9 ade2-101,trpl-D63, his3-D200, Uu2- Dl) 
using lithium acetate method and selected on Sc-ura media. The 
expression of UCP3 was achieved by induction in Sc-ura media with 2% 
galactose and 3% glycerol. 

The diploid Saccharomyces cerevisiae strain W303 (MATa /a, 

20 canl-100, ade2-10, his3-ll r 15, leu2-2 r 112, trpl~Dl, ura3-l) was also used 
for the expression of UCP3. Human UCP3 was expressed under the 
control of GAL1 promoter in pYES2 (Invitrogen, San Diego, CA). 
Constructs were made directly in yeast using a gap repair strategy. 
The entire open reading frame of human UCP3 was amplified using a 

25 pair of primers that incorporated a yeast translation start site 

(ATAATG). Taq Additive (Stratagene, La Jolla, CA) was included in 
PCR reaction to enhance amplification fidelity. The PCR product was 
fused to GAL1 at the 5* end and to CYC1 terminator at the 3' end by 
co-transforming the PCR product and pYES2 followed by selection on 

30 Sc-Ura medium. More than 50% of the transformants contained the 
expected GAL1-UCP3 plasmids. Consistent results were obtained from 
multiple independent clones for growth and flow cytometry analysis. 
For the growth assay, the W303 strain containing the vector alone or 
UCP3 expression plasmid was streaked onto Sc-Ura media with 3% 
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glycerol and 2% galactose and incubated at 30°C for 48 hours. To 
measure mitochondria membrane potential, cells were grown in Sc-Ura 
media supplemented with raffinose. Galactose was added to induce the 
expression of UCP3 for five hours. Fluorescent probe DiOC6 (3,3'- 
5 dihexyloxacarbocyanine iodine) was added after cell concentration was 
adjusted to two millions per ml. Data were collected and analyzed as 
described (Bouillaud et aL, 1994. EMBO J. 13: 1990-1997). A total 10,000 
objects were taken for the histograms. For the display of fluorescence 
intensity, a three-order logarithmic scale was used. 

10 The biochemical activity of UCP1 and UCP2 have previously been 

studied by measuring the defects of aerobic growth in yeast expressing 
UCPs. To test that UCP3 encodes an uncoupling protein for 
mitochondria, UCP3 was expressed under the control of a tightly 
inducible promoter GAL1 in the yeast system as previously described for 

15 UCP1 and UCP2 (Fleury et aL, 1997, Nature Genetics 15: 269-272; 
Gimeno et aL, 1997, Diabetes 46, 900-906). Induction with galactose 
resulted in a significant decrease in the growth rate of strains 
containing UCP3 relative to cells with vector alone. To confirm more 
directly that UCP3 reduced the mitochondria membrane potential,the 

20 fluorescent intensity of cells labeled with the potential sensitive dye 

DiOC6 was measured using flow cytometry analysis. Induction of UCP3 
expression by galactose significantly reduced the membrane potential of 
mitochondria compared with strains containing vector alone (Figure 8A 
and 8C). In addition, no membrane potential shift was observed when 

25 UCP3 expression was not induced (Figure 8B). Two membrane potential 
peaks were observed after UCP3 induction (Figure 8C). Similar results 
were observed for UCP2 (Fleury et aL, 1997, Nature Genetics 15: 269- 
272). The most likely explanation for this observation is that one peak 
represents cells with reduced membrane potential due to expression of 

30 UCP3 while the other peak represents cells with normal membrane 
potential due to the loss of UCP3 expression. When UCP2 was cloned 
into the same vector as UCP3 and transformed into the same yeast 
strain, UCP3 reduced membrane potential to a greater extent compared 
with UCP2. The data indicate that UCP3 encodes an uncoupling protein 

35 that is able to decrease the membrane potential of mitochondria. 
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EXAMPLE 4: 

Tissue Distribution Of UCP3 Expression 

5 Human multiple tissue Northern Blot #7760-1, Human 

Brain Northern Blot II #7755-1, Human Brain Northern Blot III 
#7750-1, and Human multiple tissue Northern Dot Blot were purchased 
from Clontech. The probe was made by PCR amplification of coding 
region of UCP3 (SEQ ID NO:ll). This fragment was labeled with 32 P and 

10 used to probe the Northern blot as described in the Clontech procedures, 
which are well known in the art. The blots were exposed to X-ray film 
with an intensifying screen at -70 °C for times as indicated in the figure. 
Figure 4A-4C is an autoradiograph which shows that UCP3 is highly 
expressed in skeletal muscle as a transcript of -2.4 kb (Figure 4A & B). 

15 Two minor transcripts, -6.0 kb and 7.5 kb are also expressed in skeletal 
muscle (Figure 4A). In addtion, UCP3 is also weakly expressed as a 
transcript of -2.4 kb in the heart. Probing with human beta-actin probe 
(Figure 4C) shows that all the lanes have approximately equal amount of 
total polyA RNA, indicating that detection of UCP3 only in skeletal 

20 muscle and in the heart is not due to the lack of RNA in the other tissue 
lanes. Also, In the brain, ucp3 is most abundantly expressed in the 
corpus collosum as a transcript of -2.4 kb. 

The expression pattern of UCP3 appears to be unique, because it is 
abundantly and nearly exclusively expressed in skeletal muscle. Even 

25 though UCP3 expression was also detected in the heart and brain, it was 
found at much lower levels. Skeletal muscle is responsible for more 
than half of energy expenditure at resting stage. Differences in energy 
expenditure of skeletal muscle can explain most of the variation in basal 
metabolic rate between individuals (Ravussin and Bogardus, 1992, Am J 

30 Clin Nutr 55: 242S-245S). The proton leak was shown to account for 
around one-half of the resting respiration rate of perfused rat skeletal 
muscle (Rolfe and Brand, 1996, Am J Physiol 271: C1380-1389). 
Furthermore, free fatty acids-activated uncoupling in skeletal muscle 
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was also observed previously (Brustovetsky et al., 1992, FEBS Lett 305: 15- 
17). Thus, UCP3 should provide a significant contribution to the 
uncoupling activities in skeletal muscle since it is much more strongly 
expressed than UCP1 and UCP2, and its activity is likely to be dependent 
5 on free fatty acids. 

EXAMPLE 5: 

Characterization and Chromosomal Localization of 
10 Genes Encoding UCP3 

To map the position of ucp3 in the human genome, the primer pair 
3U3F (SEQ ID NO:7) and 3U3R (SEQ ID NO:8) of ucp3 was used to carry out 
PCR reactions with the 83 clones of the Stanford radiation hybrid panel 

15 (Cox et aL, 1990, Science, 250:245:250). The PCR results were scored and 
submitted to the Stanford Genome Center for linkage analysis. It was 
found that ucp3 is located close to the marker DllS944e with a lod score of 
9.4, which corresponds to the cytogenetic location of Ilql3.1~llq23.3. 

Radiation hybrid mapping placed this gene into a region around 

20 chromosome llq2i, the same region where ucp2 is located (Fleury et al., 
1997, Nature Genetics 15: 269-272). Sequence analysis showed that UCP3 
is 72% identical to UCP2 and 56% identical to UCP1 at the amino acid 
level. Hydropathy analysis showed that UCP3 has the same 
hydrophobicity/hydrophilicity profile as UCP1 and 2. Like UCP1 and 

25 UCP2, the UCP3 protein also consists of three repeating units with each 
unit containing 2 transmembrane domains. Given this homology to 
UCP1 and UCP2, human UCP3 is predicted to encode an uncoupling 
protein. Two mouse bacterial artificial chromosome (BAG) clones 
containing mouse UCP2 were obtained using an STS in the 3' UTR of 

30 mouse UCP2* After the mouse UCP3 sequence was determined, PCR 
with mouse UCP3-specific primers scored positive with both mouse 
UCP2 BAC clones. In the mouse, the UCP2 and UCP3 genes were found 
to be in the same BAC clone suggesting that UCP3 is located close to 
UCP2 on chromosome 7. Thus, UCP2 and UCP3 are located in the same 
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chromosomal region in both human and mouse, while human UCP1 
was mapped to 4q31. The chromosomal regions where UCP2 resides are 
linked to obesity and hyperinsulinemia in both human and mouse, and 
so UCP2 was hypothesized to be a candidate gene for these diseases 
5 (Taylor and Phillips, 1996, Genomics 34, 389-398; Fleury et al., 1997, 
Nature Genetics 15: 269-272). The close location of UCP3 to UCP2 
suggests that UCP3 is also a candidate gene for obesity and 
hyperinsulinemia. In addition, the close homology and location between 
UCP2 and UCP3 suggested that the two genes arose from a gene 

10 duplication event after UCP1 was separated. 

Alignment of the six protein sequences of UCP1 to UCP3 of 
human and mouse reveals a highly clustered pattern of the conserved 
residues. The transmembrane domains and their immediate following 
sequences are much more conserved than the sequences between the 

15 transmembrane domains. Moreover, each allbnb motif following the 
first transmembrane domain of each repeat unit is perfectly conserved 
among the six proteins. Thus, UCP3 is a typical member of the 
mitochondria transport protein superfamily. UCP1 activity is inhibited 
by nucleotides but activated by free fatty acids. Affinity labeling and site- 

20 directed mutagenesis demonstrated that the region between 

transmembrane domain 5 and 6 bound to nucleotides (Mayinger and 
Klingenberg, 1992, Biochemistry 31: 10536-10543; Winkler and 
Klingenberg, 1992, EurJBiochem 203: 295-304). Particularly, deletion of 
the putative nucleotide binding motif (amino acid residue 261 to 269) 

25 renders UCP1 resistant to nucleotide-mediated inhibition (Bouillaud et 
al , 1994, EMBO J 13, 1990-1997). Substitution of Phe 267 by a Tyr also 
results in higher uncoupling activity (Bouillaud et al., id.). Both UCP2 
and UCP3 contain a Tyr instead of a Phe at this position, suggesting that 
UCP2 and UCP3 has higher basal activity. On the other hand, UCP1 

30 and UCP3 both have a Thr at the position corresponding residue 264 of 
UCP1 whereas UCP2 has an Arg. The presence of a positively charged 
residue in this motif may imply a higher affinity of UCP2 for nucleotides 
and, consequently, increased sensitivity to nucleotide-mediated 
inhibition. The very C-terminus of UCP1 was shown to be important for 

35 activation by fatty acids (Gonzalez-Barroso et al., 1996, Eur J Biochem 
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239: 445-450). Sequence alignments of the three UCP members, 
surprisingly, found that last 10 residues are not well conserved among 
each other at all, suggesting the activity of each UCP has different 
sensitivity to free fatty acids. In UCP1, change of Cys 304 to an Ala 
5 results in decreased activation while change to a Ser leads to increased 
activation by fatty acids (Gonzalez-Barroso et aL, id.). Interestingly, 
UCP2 contains an Ala whereas UCP3 contains a Ser at the Cys 304 
position, suggesting UCP3 has higher activity than UCP2. These 
comparisons suggest that UCP3 would be the most active among the 
10 three since UCP2 was shown to be more active than UCP1 under the 
same conditions (Fleury et al., 1997, Nature Genetics 15: 269-272; 
Gimeno et al., 1997, Diabetes 46: 900-906). In fact, expression of UCP3 in 
yeast reduced membrane potential more than UCP2. 

15 

EXAMPLE 6: 
Adenovirus-Mediated Leptin Expression 

All animal experiments were carried out in accordance with our 
20 institutional guidance. A total of 10 age-match female ob/ob mice were 
weighed and injected through tail vein (IV) as follows: three with El- 
deleted replication deficient adenoviral vector expressing leptin (Ad- 
leptin), three with helper-dependent adenoviral vector expressing 
leptin(HD-leptin), two with B-gal-expressing adenoviral vector(Ad-B- 
25 gal), and two were injected with buffer alone. The animals were then 
monitored for weight on a daily basis and sacrificed after one week of 
observation. Blood was collected for each animal and serum leptin 
levels were determined by Linco Research Inc. (St. Charles, MO) using 
an RIA assay. Total RNA was isolated from skeletal muscle tissue of 
30 each animal and used to prepare Northern blots. Approximately equal 
amounts of total RNA were loaded for each sample as estimated from 
the amount of ribosomal RNA in each lane. The blots were probed with 
human UCP3 and signals were quantified by phosphoimaging analysis 
(Molecular Dynamics, Sunnyvale, CA). 
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It has been reported that adenovirus-mediated leptin expression 
leads to significant reduction of body weight in ob/ob mice. Given the 
abundance of UCP3 in skeletal muscle, the effect of leptin treatment on 
UCP3 expression in skeletal muscle was tested by injecting 

5 adenoviruses expressing leptin into ob/ob mice and examining UCP3 
expression after one week. Injection of adenoviruses expressing leptin, 
either helper-dependent (HD-leptin) or first generation (Ad-leptin), led to 
significant weight loss (approximately 15%) after one week while 
injection of adenoviral vector expressing b-galactosidase (Adb-gal) had 

10 no significant effect on body weight (Figure 9A). Measurement of serum 
leptin levels at the end of one week confirmed high levels of leptin 
production in mice injected with HD-leptin (26.7 ±10 ng/ml) or Ad-leptin 
(21.8 ±16.7 ng/ml) but not in mice injected with Adb-gal (1.56 ±0.05 
ng/ml) or buffer alone (2 ±0.17 ng/ml). Wild type lean mice have leptin 

15 levels at approximately 4.5 ng/ml. UCP3 levels were then checked in 
skeletal muscle of these animals. Mice treated with adenovectors 
expressing leptin, either Ad-leptin or HD-leptin, showed approximately 
an increase of 80% and 70%, respectively, in UCP3 RNA level (Figure 9B, 
p=0.2 vs. buffer-injected mice). In contrast, mice treated with Adb-gal 

20 did not show significant change in UCP3 expression. If the average 
UCP3 level of animals showing weight loss (leptin-treated, n=6) was 
compared with that of animals showing no weight loss (b -galactosidase 
treated or untreated, n=4), the increase in UCP3 expression after leptin 
treatment was statistically significant (p=0.01). Thus, leptin treatment 

25 leads to increased UCP3 expression in skeletal muscle in ob/ob mice. 

Leptin plays a pivotal role in the regulation of food intake and 
energy expenditure. Leptin-deficient ob/ob mice are hyperphagic and 
hypometabolic, and leptin treatment led to decreased food intake and 
increased metabolic rate (Halaas et al., 1995, Science 269: 543-546, Hwa 

30 et al., 1997, Am J Physiol 272, R1204-1209; Hwa et al., 1996 Horm Metab 
Res :28: 659-663). These results suggest that UCP3 may be involved in 
the increase of metabolic rate post-leptin treatment. Leptin treatment 
led to increased expression of UCP2 in pancreatic islets and adipose 
tissue (Zhou et al., 1997). Thus, both UCP2 and UCP3 are likely to be 

35 involved in the increase of metabolic rate post-leptin treatment. 
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EXAMPLE 7: 
Isolation and Characterization of DNA Fragments 
Encoding Full-Length Mouse UCP3 

5 

Mouse UCP3 was isolated by semi-nested PCR followed by 5' and 
3' race. Three degenerate PCR primers were designed based on the 
conserved regions of the three uncoupling proteins: F441, 5'- 
CCNCTGGAYACNGCYAA-3' (SEQ ID NO:19); F755, 5'- 

10 CAGCCCACNGANGTNGT-3' (SEQ ID NO:20); and R1055, 

5'-TTCACCACRTCNACNGG-3' (SEQ ID NO:21). First round PCR was 
carried out on mouse skeletal muscle cDNA with the primer pair 
F441+R1055 . Second round PCR was carried out with the primer pair 
F755+R1055. All PCR reactions were carried out using AmpliTaq Gold 

15 (Perkin Elmer, NJ). The degerate PCR product was cloned into the 
vector pCR2.1 (Invitrogen, CA) and sequenced. Analysis of the 
sequences identified a partial cDNA sequence that was highly 
homologous to human UCP3. This partial sequence were then used to 
isolate the full-length sequence of mouse UCP3 by 5' and 3' race using a 

20 Marathon race kit from Clontech (Clontech Laboratories, CA). DNA 
sequences were all determined using the ABI Prism dye terminator 
system on ABI 377 machines. The full length cDNA encoding mouse 
UCP3 (SEQ ID NO: 17) is shown in Figure 6A and 6B and the amino acid 
sequence of mouse UCP3 (SEQ ID NO: 18) is shown in Figure 7. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Liu, Qingyun 
Chen, Fang 

(ii) TITLE OF INVENTION: Human Uncoupling Protein 3 
(iii) NUMBER OF SEQUENCES : 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merck & Co., Inc. 

(B) STREET: P.O. Box 2000 RY60-30 

(C) CITY: Rahway 

(D) STATE: NJ 

(E) COUNTRY: US 

(F) ZIP : 07065-0907 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Hand, J. Mark 

(B) REGISTRATION NUMBER: 36,545 

(C) REFERENCE /DOCKET NUMBER: 19 9 59 Y 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 732/594-3905 

(B) TELEFAX: 732/594-4720 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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10 
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20 
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30 
35 
40 
45 
50 
55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGTGACCTAC GACATCCTCA AGGAGAAGCT GCTGGACTAC CACCTGCTCA CTGACAACTT 60 

CCCCTGCCAC TTTGTCTCTG CCTTTGGAGC CGGCTTCTGT GCCACAGTGG TGGCATCCCC 120 

GGTGGACGTG GTGAAGACCC GGTATATGAA CTCACCTCCA GGCCAGTACT TCAGCCCCCT 180 

CGACTGTATG ATAAAGATGG TGGCCCAGGA GCGCCACCAG CCTTCTACAA GGGATTTACA 240 

CCCTCCTTTT TGCGTTTGGG ATCCTGGAAC GTGGTGATGT TCGTAACCTA TGAGCAGCTG 300 

AAACGGGCCC TGATGAAAGT CCAGATGTTA CGGGAATCAC CGTTTTGAAC AAGACAAGAA 360 

GGCCACTGGT AGCTAACGTG TCCGAAACCA GTTAAGAATG GAAG 404 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GGTGACCTAC GACATCCTCA AGGAGAAGCT GCTGGACTAC CACCTGCTCA CTGACAACTT 60 

CCCCTGCCAC TTTGTCTCTG CCTTTGGAGC CGGCTTCTGT GCCACAGTGG TGGCATCCCC 120 

GGTGGACGTG GTGAAGACCC GGTATATGAA CTCACCTCCA GGCCAGTACT TCAGCCCCCT 180 

CGACTGTATG ATAAAGATGG TGGCCCAGGA GCGCCACCAG CCTTCTACAA GGGATTTACA 240 

CCCTCCTTNT TTGCGTTTGG GATCCTGGAA CGTGGTGATG TTCGTAACCT ATGAGCAGCT 3 00 

GAAACGGGCC CTGATGAAAG TCCAGATGTT ACGGGAATCA CCGTTTTGAA CAAGACAAGA 360 

AGGCCACTGG TAGCTAACGT GTCCGAAACC AGTTAAGAAT GGAAG 405 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
CGAGCAGCTG AAACGGGCCC TGATGAAAGT CCAGATGTTA CGGGNATCAC CGTTTTGAAC 60 

5 

AAGACAAGAA GGCCACTGGT AGCTAACGTN TCCGAAACCA GTTAAGATTG GAAGAAAACG 120 
GTCCATCCAC GNACACATGG ACACAGACCC ACACATNTT 159 
10 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



40 



50 



55 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

25 TTTTTTTTGT TCTTACTCCC ACACCTAAGG TGGAANTTCT TTTATTGAGT CATAATAATT 60 

TCCCGAGAAT TCCGAGTCCT GCTACTTTAG GTTCTTGCCC AGGAATCCAC CTCTTTTCCC 120 

CCAAGCCCAA CAATCCTTTG AGGTACTCAT GATTGAGCGC GTGGTGGGGG GGGGTOGGGA 180 

30 

AGAGGCTGCA TGGGGGTGGG GCTCCTGTGG CTTCACGTCA TCCACTGTCA CCTCTGGTCC 240 

CCAAGTCTCT GGATCCTTTG GTCTCACCTC TAGACAACCG GCGGGGTTCA AACCTTCTTC 300 

35 CCTGGCAACT CCTCTCTGTC CCGACAAAAT CTCTCCCAAG GCATTGTCCT TGTAGTTAGA 3 60 

TTTACACAGA GCTTTTGCTT TTATAAAGTG CGTTCATGCC CAGCTTCTCA CTTGCATGTC 420 

ATAGCACCCC TGGTGAGGTG GACAGGGAAG GGATGGCTCC CTCCATTTTG TAGGAAAGTN 480 

GGGG 484 
(2) INFORMATION FOR SEQ ID NO: 5: 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
AAGCTGCTGG ACTACCACCT GCTC 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleot ide H 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TACTGCCCTG GAGGTGAGTT CA 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
CCAAGCCCAA CAATCCTTTG A 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



.47- 



CCAAAGGATC CAGAGACTTG G 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
TTGTGTGGAA TTGTGAGCGG ATAAC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : s ing 1 e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCAGGGTTTT CCCAGTCACG AC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TCGAACTCAC TCACCTCCCC TCTCACCTCA CTGCCCTCAC CAGCCAGCCT CTTGTCAAGT 
GATCAGGCTG TCAACCAACT TCTCTAGGAT AAGGTTTCAG GTCAGCCTGT GTGTATAAGA 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



CCAGTGCCAA 


GCCAGAAGCA 


GCAGAGACAA 


CAGTGAATGA 


CAAGGAGGGG 


CCATCCAATC 


180 


CCTGCTGCCA 


CCTCCTGGGA 


TGGAGCCCTA 


GGGAGCCCCT 


GTGCTGCCCC 


TGCCGTGGCA 


240 


GGACTCACAG 


CCCCACCGCT 


GCACTGAAGC 


CCAGGGCTGT 


GGAGCAGCTC 


TCTCCTTGGA 


300 


CTCCTCTCGG 


CCCTAAAGGG 


ACTGGGCAGA 


GCCTTCCAGG 


ACTATGGTTG 


GACTGAAGCC 


360 


TTCAGACGTG 


CCTCCCACCA 


TGGCTGTGAA 


GTTCCTGGGG 


GCAGGCACAG 


CAGCCTGTTT 


420 


TGCTGACCTC 


GTTACCTTTC 


CACTGGACAC 


AGCCAAGGTC 


CGCCTGC AGA 


TCCAGGGGGA 


480 


GAACCAGGCG 


GTCCAGACGG 


CCCGGCTCGT 


GCAGTACCGT 


GGCGTGCTGG 


GCACCATCCT 


540 


GACCATGGTG 


CGGACTGAGG 


GTCCCTGCAG 


CCCCTACAAT 


GGGCTGGTGG 


CCGGCCTGCA 


600 


GCGCCAGATG 


AGCTTCGCCT 


CCATCCGCAT 


CGGCCTTTAC 


GACTCCGTCA 


AGCAGGTGTA 


660 


CACCCCCAAA 


GGCGCGGACA 


ACTCCAGCCT 


CACTACCCGG 


ATTTTGGCCG 


GCTGCACCAC 


720 


AGGAGCCATG 


GCGGTGACCT 


GTGCCCAGCC 


CACAGATGTG 


GTGAAGGTCC 


GATTTCAGGC 


780 


CAGCATACAC 


CTCGGGCCAT 


CCAGGAGCGA 


CAGAAAATAC 


AGCGGGACTA 


TGGACGCCTA 


840 


CAGAACCATC 


GCCAGGGAGG 


AAGGAGTCAG 


GGGCCTGTGG 


AAAGGAACTT 


TGCCCAACAT 


900 


CATGAGGAAT 


GCTATCGTCA 


ACTGTGCTGA 


GGTGGTGACC 


TACGACATCC 


TCAAGGAGAA 


960 


GCTGCTGGAC 


TATCACCTGC 


TCACTGACAA 


CTTCCCCTGC 


CACTTTGTCT 


CTGCCTTTGG 


1020 


AGCCGGCTTC 


TGTGCCACAG 


TGGTGGCCTC 


CCCGGTGGAC 


GTGGTGAAGA 


CCCGGTATAT 


1080 


GAACTCACCT 


CCAGGCCAGT 


ACTTCAGCCC 


CCTCGACTGT 


ATGATAAAGA 


TGGTGGCCCA 


1140 


GGAGGGCCCC 


ACAGCCTTCT 


ACAAGGGATT 


TACACCCTCC 


TTTTTGCGTT 


TGGGATCCTG 


1200 


GAACGTGGTG 


ATGTTCGTAA 


CCTATGAGCA 


GCTGAAACGG 


GCCCTGATGA 


AAGTCCAGAT 


1260 


GTTACGGGAA 


TCACCGTTTT 


GAACAAGACA 


AGAAGGCCAC 


TGGTAGCTAA 


CGTGTCCGAA 


1320 


ACCAGTTAAG 


AATGGAAGAA 


AACGGTGCAT 


CCACGCACAC 


ATGGACACAG 


ACCCACACAT 


1380 


GTTTACAGAA 


CTGTTGTTTA 


CTTGTTGCTG 


ATTCAAGAAA 


CAGAAGTAGA 


AGAGAGAGGA 


1440 


TTCTGGTCTT 


CACTGCCATG 


CCTCAAGAAC 


ACCTTTGTTT 


TGCACTGACA 


AGATGGAAAA 


1500 


TAAATTATAT 


TAATTTTTGA 


AACCCATTAG 


GCATGCCTAA 


TATTTAGGCA 


AGAGAAAATA 


1560 


AACCAAGATA 


GATCCATTTG 


GACAAAATGG 


AAGGTTGGAG 


ACGTGTATCC 


CCGTGAAATC 


1620 


TGGTCAGATA 


ATGAATGATA 


AGCAGGAAGG 


ATGGCAAGCA 


CGGGACAGGA 


GGGGCCCACA 


1680 


ATGGAGTGGG 


AGATCAGCCA 


CGGAGCCTGG 


AGGGACGCCA 


CCCAGCAACA 


CAGAGCTGGC 


1740 


GACTGCAGCT 


GCACCATCAC 


ACATGCATCA 


TCAGCCTATT 


TGTAATATGT 


CTGCCACAGA 


1800 
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GAGTCCTTTG GGATTCTAGG AAACCCAAGG AACAAGAGAA AAAACTAGAG CCTGTGCTAA 1860 

AGAAGCCTGC TGGGCCCATG TGAGGCTGGG GTCGTAAATA TTCCCCGACG ACACTGAAGA 1920 

ATCAAGAGGG CAGCCCCCAC TTCTCCTACA AAATGGAGGG AGCCATCCCT TCCCTGTCCA 1980 

CCTCACCAGG GGTGCTATGA CATGCAAGTG AGAAGCTGGG CATGAACGCA CTTTATAAAA 2040 

GCAAAAGCTC TGTGTAAATC TAACTACAAG GACAATGCCT TGGGAGAGAT TTTGTCGGGA 2100 

CAGAGAGGAG TTGCCAGGGA AGAAGGTTTG AAAGATACGG TTGTCTAGAG GTGAGACCAA 2160 

AGGATCCAGA GACTTGGGGA CCAGAGGTGA CAGTGGATGA CGTGAAGCCA CAGGAGCCCC 2220 

ACCCCCATGC AGCTTTTTCC CCACCCCCCC CACCACGCGC TCAATCATGA GTACCTCAAA 2280 

GGATTGTTGG GCTTGGGGGA AAAGAGGTGG ATTCCTGGGC AAGAACCTAA AGTAGCAGGA 2340 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Gly Leu Lys Pro Ser Asp Val Pro Pro Thr Met Ala Val Lys 
15 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Asp Leu Val Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Gin 
35 40 45 

Ala Val Gin Thr Ala Arg Leu Val Gin Tyr Arg Gly Val Leu Gly Thr 
50 55 60 

lie Leu Thr Met Val Arg Thr Glu Gly Pro Cys Ser Pro Tyr Asn Gly 
65 70 75 80 

Leu Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser He Arg He 
85 90 95 

Gly Leu Tyr Asp Ser Val Lys Gin Val Tyr Thr Pro Lys Gly Ala Asp 
100 105 110 
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Asn Ser Ser Leu Thr Thr Arg He Leu Ala Gly Cys Thr Thr Gly Ala 
115 120 125 

Met Ala Val Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 
130 135 140 

Gin Ala Ser He His Leu Gly Pro Ser Arg Ser Asp Arg Lys Tyr Ser 
145 150 155 160 

Gly Thr Met Asp Ala Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg 
165 170 175 

Gly Leu Trp Lys Gly Thr Leu Pro Asn He Met Arg Asn Ala He Val 
180 185 190 

Asn Cys Ala Glu Val Val Thr Tyr Asp He Leu Lys Glu Lys Leu Leu 
195 200 205 

Asp Tyr His Leu Leu Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala 
210 215 220 

Phe Gly Ala Gly Phe Cys Ala Thr Val Val Ala Ser Pro Val Asp Val 
225 230 235 240 

Val Lys Thr Arg Tyr Met Asn Ser Pro Pro Gly Gin Tyr Phe Ser Pro 
245 250 255 

Leu Asp Cys Met He Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe 
260 265 270 

Tyr Lys Gly Phe Thr Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val 
275 280 * 285 

Val Met Phe Val Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val 
290 295 300 

Gin Met Leu Arg Glu Ser Pro Phe 
305 310 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CATAGAATTC CAGGACTATG GTTGGAC 
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(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CATTCTCGAG CTACCAGTGG CCTTCTTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCCGGATCGG ACTACTAGCA 
(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGGGGAGGG CGTGAATGTA A 
(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1658 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 





pp a nr* A A p A p. 

LVAvj^j/xriLrVJ' 


P AG AG A P A A P 


AGTG A ATGGT 

rtVJ X VJ X vT\J X 


GAGGCCCGGC 


CGTCAGATCC 


TCCTGCTACC 


60 




X Aft I \j<j /tAj x\j 


GA'PPPTTAGG 


VJ 1 ouLvL X UL 


ACTACCCAAC 


CTTGGCTAGA 


CGCACAGCTT 


120 


20 


CCTCCCTGAA 


CTGAAGCAAA 
p agpppp ait 


AGATTGCCAG 

PPPPGGGAPP 


GCAAGCTCTC 
ATGGTTVJGAC 

£\ X ^JVJ X X U\3rtL 


TCCTCGGACC 
TTCAGCCCTC 


TCCATAGGCA 
CGAAGTGCCT 


180 
240 




LCLALAALLKj 


1 I Vj IVa AAL» 1 1 


PP r PP/Tirv"2PPP 
LL 1\jVj\j<j\jLL 


ggp a nvireif^ 


PPTGTTTTGC 

LL X VJ X X X X wL 


GGACCTCCTC 


300 


ZD 


ACTTTTCCCC 


i\jAj AL ALLbL 


PA aPPTVPPT 
L AALtL» 1/LLvj 1 


P TViP A G A TP P 
L XVjLftVjriX LL 


A A GGGG AG AA 


CCCAGGGGCT 


360 




L ALxALtLVj ITjL 




rpp rpp p TViPp.'T 1 
IV? 1\jL IVtuu 1 


APP ATPPTGA 

rtL LA X LL X VJrt 


PTATGGTGCG 

L X X \J\J X wL\J 


CACAGAGGGT 


420 


30 


CCCCGCAGCC 

Ai ILbAAl lo 


CCTACAGCGG 


ACTGGTCGCT 

PTPTYITP A AP. 


GGCCTGCACC 
PAGTTPTAPA 

L^^w X X L X ttLn 


GCCAGATGAG 
CCCCC AAGGG 


TTTTGCCTCC 
AGCGGACCAC 


480 
540 




i LL ALtHj 1 LLr 


PP ATP AnP AT 1 
LL, A x L AL»Lr/i 1 


TPTGPP AGGP 

1 L XVj\jLrtVJVjL 


TGP APG AP AG 

X VJLrtLVJrtLrtVJ 


GAGCCATGGC 


AGTGACCTGC 


600 


35 


GCCCAGCCCA 


CGGATGTGGT 


GAAGGTCCGA 


TTTCAAGCCA 


TGATACGCCT 


GGGAACTGGA 


660 




GGAGAGAGGA 


AATACAGAGG 


GACTATGGAT 


GCCTACAGAA 


CCATCGCCAG 


GGAGGAAGGA 


720 


40 


GTCAGGGGCC 
GCTGAGATGG 


TGTGGAAAGG 
TGACCTACGA 


GACTTGGCCC 
CATCATCAAG 


AACATCACAA 
GAGAAGTTGC 


GAAATGCCAT 
TGGAGTCTCA 


TGTCAACTGT 
CCTGTTTACT 


780 
840 




GACAACTTCC 


CCTGTCACTT 


TGTCTCTGCC 


TTTGGAGCTG 


GCTTCTGTGC 


CACAGTGGTG 


900 


45 


GCCTCCCCGG 


TGGATGTGGT 


AAAGACCCGA 


TACATGAACG 


CTCCCCTAGG 


CAGGTACCGC 


960 




AGCCCTCTGC 


ACTGTATGCT 


GAAGATGGTG 


GCTCAGGAGG 


GACCCACGGC 


CTTCTACAAA 


1020 


50 


GGATTTGTGC 
GAGCAACTGA 


CCTCCTTTCT 
AGAGGGCCTT 


GCGTCTGGGA 
AATGAAAGTC 


GCTTGGAACG 
CAGGTACTGC 


TGATGATGTT 
GGGAATCTCC 


TGTAACATAT 
GTTTTGAACA 


1080 
1140 




AGGCAAGCAG 


GCTGCCTGGA 


ACAGAACAAA 


GCGTCTCTGC 


CCTGGGGACA 


CAGGCCCACA 


1200 


55 


CGGTCCAGAA 


CCCTGCACTG 


CTGCTGACAC 


GAGAAACTGA 


ACTAAAAGAG 


GAGAGTTTTA 


1260 
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GTCCTCCGTG TTTCGTCCTA AAACACCTCT GTTTTGCACT GACCTGATGG GAAATAAATT 1320 

ATATTAATTT TTAAACCCTT TCCGGTTGGA TGCCTAACAT TTAGGCAAGA GACAACAAAG 1380 

AAAACCAGAG TCAACTCCCT TGAAATGTAG GAATAAAGGA TGCATAATAA ACAGGAAAGG 1440 

CACAGGTTTT GAGAAGATCA GCCCACAGTG TTGTCCTTGA ATCAAACAAA ATGGTCGGAG 1500 

GAACCCTTCG GGTTCAGCAC AAAGAGGTGA CTACAGCCTT TTGGTCACCA GATGACTCCG 1560 

CCCCTTTGTA ATGAGTCTGC CAAGTAGACT CTATCAAGAT TCTGGGGAAA GGAGAAAGAA 1620 

CACATTGACC TGCCCGGGCG GCCGCTCGAG CCCTATGA 1658 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Val Gly Leu Gin Pro Ser Glu Val Pro Pro Thr Thr Val Val Lys 
15 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Asp Leu Leu Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Pro 
35 40 45 

Gly Ala Gin Ser Val Gin Tyr Arg Gly Val Leu Gly Thr He Leu Thr 
50 55 60 

Met Val Arg Thr Glu Gly Pro Arg Ser Pro Tyr Ser Gly Leu Val Ala 
65 70 75 80 

Gly Leu His Arg Gin Met Ser Phe Ala Ser He Arg He Gly Leu Tyr 
85 90 95 

Asp Ser Val Lys Gin Phe Tyr Thr Pro Lys Gly Ala Asp His Ser Ser 
100 105 110 

Val Ala He Arg He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val 
115 120 125 

Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Met 
130 135 140 
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lie Arg Leu Gly Thr Gly Gly Glu Arg Lys Tyr Arg Gly Thr Met Asp 
145 150 155 160 



Ala Tyr Arg Thr lie Ala Arg Glu Glu Gly Val Arg Gly Leu Trp Lys 
165 170 175 

Gly Thr Trp Pro Asn lie Thr Arg Asn Ala lie Val Asn Cys Ala Glu 
180 185 190 

Met Val Thr Tyr Asp lie lie Lys Glu Lys Leu Leu Glu Ser His Leu 
195 200 205 

Phe Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly 
210 215 220 

Phe Cys Ala Thr Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg 
225 230 235 240 

Tyr Met Asn Ala Pro Leu Gly Arg Tyr Arg Ser Pro Leu His Cys Met 
245 250 255 

Leu Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe Tyr Lys Gly Phe 
260 265 270 

Val Pro Ser Phe Leu Arg Leu Gly Ala Trp Asn Val Met Met Phe Val 
275 280 285 

Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val Gin Val Leu Arg 
290 295 300 

Glu Ser Pro Phe 
305 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "degenerate oligonucleotide* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CCNCTGGAYA CNGCYAA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "degenerate oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CAGCCCACNG ANGTNGT 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "degenerate oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTCACCACRT CNACNGG 



-56- 



WHAT IS CLAIMED IS: 

1. A purified DNA molecule encoding a human uncoupling 
protein 3 which comprises the nucleotide sequence 

5 TCGAACTCAC TCACCTCCCC TCTCACCTCA CTGCCCTCAC CAGCCAGCCT 
CTTGTCAAGT GATCAGGCTG TCAACCAACT TCTCTAGGAT AAGGTTTCAG 
GTCAGCCTGT GTGTATAAGA CCAGTGCCAA GCCAGAAGCA GCAGAGACAA 
CAGTGAATGA CAAGGAGGGG CCATCCAATC CCTGCTGCCA CCTCCTGGGA 
TGGAGCCCTA GGGAGCCCCT GTGCTGCCCC TGCCGTGGCA GGACTCACAG 

10 CCCCACCGCT GCACTGAAGC CCAGGGCTGT GGAGCAGCTC TCTCCTTGGA 
CTCCTCTCGG CCCTAAAGGG ACTGGGCAGA GCCTTCCAGG ACTATGGTTG 
GACTGAAGCC TTCAGACGTG CCTCCCACCA TGGCTGTGAA GTTCCTGGGG 
GCAGGCACAG CAGCCTGTTT TGCTGACCTC GTTACCTTTC CACTGGACAC 
AGCCAAGGTC CGCCTGCAGA TCCAGGGGGA GAACCAGGCG GTCCAGACGG 

15 CCCGGCTCGT GCAGTACCGT GGCGTGCTGG GCACCATCCT GACCATGGTG 
CGGACTGAGG GTCCCTGCAG CCCCTACAAT GGGCTGGTGG CCGGCCTGCA 
GCGCCAGATG AGCTTCGCCT CCATCCGCAT CGGCCTTTAC GACTCCGTCA 
AGCAGGTGTA CACCCCCAAA GGCGCGGACA ACTCCAGCCT CACTACCCGG 
ATTTTGGCCG GCTGCACCAC AGGAGCCATG GCGGTGACCT GTGCCCAGCC 

20 CACAGATGTG GTGAAGGTCC GATTTCAGGC CAGCATACAC CTCGGGCCAT 
CCAGGAGCGA CAGAAAATAC AGCGGGACTA TGGACGCCTA CAGAACCATC 
GCCAGGGAGG AAGGAGTCAG GGGCCTGTGG AAAGGAACTT TGCCCAACAT 
CATGAGGAAT GCTATCGTCA ACTGTGCTGA GGTGGTGACC TACGACATCC 
TCAAGGAGAA GCTGCTGGAC TATCACCTGC TCACTGACAA CTTCCCCTGC 

25 CACTTTGTCT CTGCCTTTGG AGCCGGCTTC TGTGCCACAG TGGTGGCCTC 
CCCGGTGGAC GTGGTGAAGA CCCGGTATAT GAACTCACCT CCAGGCCAGT 
ACTTCAGCCC CCTCGACTGT ATGATAAAGA TGGTGGCCCA GGAGGGCCCC 
ACAGCCTTCT ACAAGGGATT TACACCCTCC TTTTTGCGTT TGGGATCCTG 
GAACGTGGTG ATGTTCGTAA CCTATGAGCA GCTGAAACGG GCCCTGATGA 

30 AAGTCCAGAT GTTACGGGAA TCACCGTTTT GAACAAGACA AGAAGGCCAC 
TGGTAGCTAA CGTGTCCGAA ACCAGTTAAG AATGGAAGAA AACGGTGCAT 
CCACGCACAC ATGGACACAG ACCCACACAT GTTTACAGAA CTGTTGTTTA 
CTTGTTGCTG ATTCAAGAAA CAGAAGTAGA AGAGAGAGGA TTCTGGTCTT 
CACTGCCATG CCTCAAGAAC ACCTTTGTTT TGCACTGACA AGATGGAAAA 

35 TAAATTATAT TAATTTTTGA AACCCATTAG GCATGCCTAA TATTTAGGCA 
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AGAGAAAATA AACCAAGATA GATCCATTTG GACAAAATGG AAGGTTGGAG 
ACGTGTATCC CCGTGAAATC TGGTCAGATA ATGAATGATA AGCAGGAAGG 
ATGGCAAGCA CGGGACAGGA GGGGCCCACA ATGGAGTGGG AGATCAGCCA 
CGGAGCCTGG AGGGACGCCA CCCAGCAACA CAGAGCTGGC GACTGCAGCT 

5 GCACCATCAC ACATGCATCA TCAGCCTATT TGTAATATGT CTGCCACAGA 
GAGTCCTTTG GGATTCTAGG AAACCCAAGG AACAAGAGAA AAAACTAGAG 
CCTGTGCTAA AGAAGCCTGC TGGGCCCATG TGAGGCTGGG GTCGTAAATA 
TTCCCCGACG ACACTGAAGA ATCAAGAGGG CAGCCCCCAC TTCTCCTACA 
AAATGGAGGG AGCCATCCCT TCCCTGTCCA CCTCACCAGG GGTGCTATGA 

10 CATGCAAGTG AGAAGCTGGG CATGAACGCA CTTTATAAAA GCAAAAGCTC 
TGTGTAAATC TAACTACAAG GACAATGCCT TGGGAGAGAT TTTGTCGGGA 
CAGAGAGGAG TTGCCAGGGA AGAAGGTTTG AAAGATACGG TTGTCTAGAG 
GTGAGACCAA AGGATCCAGA GACTTGGGGA CCAGAGGTGA CAGTGGATGA 
CGTGAAGCCA CAGGAGCCCC ACCCCCATGC AGCTTTTTCC CCACCCCCCC 

15 CACCACGCGC TCAATCATGA GTACCTCAAA GGATTGTTGG GCTTGGGGGA 
AAAGAGGTGG ATTCCTGGGC AAGAACCTAA AGTAGCAGGA, 

disclosed as SEQ ID NO: 11. 

2. A DNA molecule of claim 1 which comprises from 
20 about nucleotide 344 to about nucleotide 1282 of SEQ ID NO:ll. 

3. A purified DNA molecule encoding human 
uncoupling protein 3 wherein said DNA molecule encodes a protein 
comprising the amino acid sequence 

25 mvglkpsdvpptmavkflgagtaacfadlvtf pldtakvrlqi qgenqavqtarlvqyr 
gvlgti ltmvrtegpcs pynglvaglqrqmsfas iriglydsvkqvytpkgadnssltt 
ri lagcttgamavtcaqptdwkvrfqas ihlgpsrsdrkysgtmdayrti areegvrg 
lwkgtlpnimpjstaiwcaevvtydilkeklldyhlltdnfpchfvsafgagfcatvvas 
pvdwktrymnsppgqyfsplix:mikmvaqegpt^ 

30 QLKRALMKVQMLRESPF, as set forth in three-letter abbreviation in SEQ ID 
NO:12. 

4. An expression vector for the expression of a human 
uncoupling 3 protein in a recombinant host cell wherein said expression 

35 vector comprises the DNA molecule of claim 1. 
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5. An expression vector of claim 4 which is a eukaryotic 
expression vector, 

5 6. An expression vector of claim 4 which is a 

prokaryotic expression vector. 

7. A host cell which expresses a recombinant human 
uncoupling 3 protein wherein said host cell contains the expression 

10 vector of claim 4. 

8. A host cell which expresses a recombinant human 
uncoupling 3 protein wherein said host cell contains the expression 
vector of claim 5. 

15 

9. A host cell which expresses a recombinant human 
uncoupling protein 3 wherein said host cell contains the expression 
vector of claim 6. 

20 10. A host cell of claim 7 wherein said human 

uncoupling protein 3 is overexpressed from said expression vector. 

11. A host cell of claim 8 wherein said human 
uncoupling protein 3 is overexpressed from said expression vector. 

25 

12. A host cell of claim 9 wherein said human 
uncoupling protein 3 is overexpressed from said expression vector. 

13. A subcellular membrane fraction obtained from the 
30 host cell of claim 10 which contains recombinant human uncoupling 

protein 3. 

14. A subcellular membrane fraction obtained from the 
host cell of claim 11 which contains recombinant human uncoupling 

35 protein 3 . 
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15. A subcellular membrane fraction obtained from the 
host cell of claim 12 which contains recombinant human uncoupling 
protein 3. 

5 16. A purified DNA molecule which consists of the 

nucleotide sequence 

TCGAACTCAC TCACCTCCCC TCTCACCTCA CTGCCCTCAC CAGCCAGCCT 
CTTGTCAAGT GATCAGGCTG TCAACCAACT TCTCTAGGAT AAGGTTTCAG 
GTCAGCCTGT GTGTATAAGA CCAGTGCCAA GCCAGAAGCA GCAGAGACAA 

10 CAGTGAATGA CAAGGAGGGG CCATCCAATC CCTGCTGCCA CCTCCTGGGA 
TGGAGCCCTA GGGAGCCCCT GTGCTGCCCC TGCCGTGGCA GGACTCACAG 
CCCCACCGCT GCACTGAAGC CCAGGGCTGT GGAGCAGCTC TCTCCTTGGA 
CTCCTCTCGG CCCTAAAGGG ACTGGGCAGA GCCTTCCAGG ACTATGGTTG 
GACTGAAGCC TTCAGACGTG CCTCCCACCA TGGCTGTGAA GTTCCTGGGG 

15 GCAGGCACAG CAGCCTGTTT TGCTGACCTC GTTACCTTTC CACTGGACAC 
AGCCAAGGTC CGCCTGCAGA TCCAGGGGGA GAACCAGGCG GTCCAGACGG 
CCCGGCTCGT GCAGTACCGT GGCGTGCTGG GCACCATCCT GACCATGGTG 
CGGACTGAGG GTCCCTGCAG CCCCTACAAT GGGCTGGTGG CCGGCCTGCA 
GCGCCAGATG AGCTTCGCCT CCATCCGCAT CGGCCTTTAC GACTCCGTCA 

20 AGCAGGTGTA CACCCCCAAA GGCGCGGACA ACTCCAGCCT CACTACCCGG 
ATTTTGGCCG GCTGCACCAC AGGAGCCATG GCGGTGACCT GTGCCCAGCC 
CACAGATGTG GTGAAGGTCC GATTTCAGGC CAGCATACAC CTCGGGCCAT 
CCAGGAGCGA CAGAAAATAC AGCGGGACTA TGGACGCCTA CAGAACCATC 
GCCAGGGAGG AAGGAGTCAG GGGCCTGTGG AAAGGAACTT TGCCCAACAT 

25 CATGAGGAAT GCTATCGTCA ACTGTGCTGA GGTGGTGACC TACGACATCC 
TCAAGGAGAA GCTGCTGGAC TATCACCTGC TCACTGACAA CTTCCCCTGC 
CACTTTGTCT CTGCCTTTGG AGCCGGCTTC TGTGCCACAG TGGTGGCCTC 
CCCGGTGGAC GTGGTGAAGA CCCGGTATAT GAACTCACCT CCAGGCCAGT 
ACTTCAGCCC CCTCGACTGT ATGATAAAGA TGGTGGCCCA GGAGGGCCCC 

30 ACAGCCTTCT ACAAGGGATT TACACCCTCC TTTTTGCGTT TGGGATCCTG 
GAACGTGGTG ATGTTCGTAA CCTATGAGCA GCTGAAACGG GCCCTGATGA 
AAGTCCAGAT GTTACGGGAA TCACCGTTTT GAACAAGACA AGAAGGCCAC 
TGGTAGCTAA CGTGTCCGAA ACCAGTTAAG AATGGAAGAA AACGGTGCAT 
CCACGCACAC ATGGACACAG ACCCACACAT GTTTACAGAA CTGTTGTTTA 

35 CTTGTTGCTG ATTCAAGAAA CAGAAGTAGA AGAGAGAGGA TTCTGGTCTT 
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CACTGCCATG CCTCAAGAAC ACCTTTGTTT TGCACTGACA AGATGGAAAA 
TAAATTATAT TAATTTTTGA AACCCATTAG GCATGCCTAA TATTTAGGCA 
AGAGAAAATA AACCAAGATA GATCCATTTG GACAAAATGG AAGGTTGGAG 
ACGTGTATCC CCGTGAAATC TGGTCAGATA ATGAATGATA AGCAGGAAGG 
5 ATGGCAAGCA CGGGACAGGA GGGGCCCACA ATGGAGTGGG AGATCAGCCA 
CGGAGCCTGG AGGGACGCCA CCCAGCAACA CAGAGCTGGC GACTGCAGCT 
GCACCATCAC ACATGCATCA TCAGCCTATT TGTAATATGT CTGCCACAGA 
GAGTCCTTTG GGATTCTAGG AAACCCAAGG AACAAGAGAA AAAACTAGAG 
CCTGTGCTAA AGAAGCCTGC TGGGCCCATG TGAGGCTGGG GTCGTAAATA 

10 TTCCCCGACG ACACTGAAGA ATCAAGAGGG CAGCCCCCAC TTCTCCTACA 
AAATGGAGGG AGCCATCCCT TCCCTGTCCA CCTCACCAGG GGTGCTATGA 
CATGCAAGTG AGAAGCTGGG CATGAACGCA CTTTATAAAA GCAAAAGCTC 
TGTGTAAATC TAACTACAAG GACAATGCCT TGGGAGAGAT TTTGTCGGGA 
CAGAGAGGAG TTGCCAGGGA AGAAGGTTTG AAAGATACGG TTGTCTAGAG 

15 GTGAGACCAA AGGATCCAGA GACTTGGGGA CCAGAGGTGA CAGTGGATGA 
CGTGAAGCCA CAGGAGCCCC ACCCCCATGC AGCTTTTTCC CCACCCCCCC 
CACCACGCGC TCAATCATGA GTACCTCAAA GGATTGTTGG GCTTGGGGGA 
AAAGAGGTGG ATTCCTGGGC AAGAACCTAA AGTAGC AGGA , 
disclosed as SEQ ID NO: 11. 

20 

17. A purified DNA molecule encoding a human 
uncoupling protein 3 wherein said DNA molecule encodes a protein 
consisting of the amino acid sequence 

mvglkpsdvpptmavkflgagtaacfadlvtfpldtakvrlqiqgenqavqtarlvqyr 
25 gvlgtiltmvrtegpcspynglvaglqrqmsfasiriglydsvkqvytpkgadnssltt 
rilagcttgamavtcaqptdwkvrfqasihlg 

lwkgtlpnimrnaiwcaevvtydilkeklldyhlltdnfpchfvsafgagfcatvvas 
pvdvvktrymnsppgqyfsplix:mikmvaqegptafykgftp 
qlkralmkvqmlres pf , 
30 as set forth in three-letter abbreviation in SEQ ID NO:12. 

18. A process for the expression of a human uncoupling 
protein 3 in a recombinant host cell, comprising: 
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(a) transfecting the expression vector of claim 4 into 
a suitable host cell; and, 

(b) culturing the host cells of step (a) under 

5 conditions which allow expression of the human uncoupling protein 
from the expression vector. 

19. An expression vector for the expression of a human 
uncoupling protein 3 in a recombinant host cell wherein said expression 

10 vector comprises the DNA molecule of claim 16. 

20. A purified DNA molecule encoding a mouse 
uncoupling protein which comprises the nucleotide sequence 

CCAGGAACAG CAGAGACAAC AGTGAATGGT GAGGCCCGGC CGTCAGATCC 

15 TGCTGCTACC TAATGGAGTG GATCCTTAGG GTGGCCCTGC ACTACCCAAC 
CTTGGCTAGA CGCACAGCTT CCTCCCTGAA CTGAAGCAAA AGATTGCCAG 
GCAAGCTCTC TCCTCGGACC TCCATAGGCA GCAAAGGAAC CAGGCCCATT 
CCCCGGGACC ATGGTTGGAC TTCAGCCCTC CGAAGTGCCT CCCACAACGG 
TTGTGAAGTT CCTGGGGGCC GGCACTGCGG CCTGTTTTGC GGACCTCCTC 

20 ACTTTTCCCC TGGACACCGC CAAGGTCCGT CTGCAGATCC AAGGGGAGAA 
CCCAGGGGCT CAGAGCGTGC AGTACCGCGG TGTGCTGGGT ACCATCCTGA 
CTATGGTGCG CACAGAGGGT CCCCGCAGCC CCTACAGCGG ACTGGTCGCT 
GGCCTGCACC GCCAGATGAG TTTTGCCTCC ATTCGAATTG GCCTCTACGA 
CTCTGTCAAG CAGTTCTACA CCCCCAAGGG AGCGGACCAC TCCAGCGTCG 

25 CCATCAGGAT TCTGGCAGGC TGCACGACAG GAGCCATGGC AGTGACCTGC 
GCCCAGCCCA CGGATGTGGT GAAGGTCCGA TTTCAAGCCA TGATACGCCT 
GGGAACTGGA GGAGAGAGGA AATACAGAGG GACTATGGAT GCCTACAGAA 
CCATCGCCAG GGAGGAAGGA GTCAGGGGCC TGTGGAAAGG GACTTGGCCC 
AACATCACAA GAAATGCCAT TGTCAACTGT GCTGAGATGG TGACCTACGA 

30 CATCATCAAG GAGAAGTTGC TGGAGTCTCA CCTGTTTACT GACAACTTCC 
CCTGTCACTT TGTCTCTGCC TTTGGAGCTG GCTTCTGTGC CACAGTGGTG 
GCCTCCCCGG TGGATGTGGT AAAGACCCGA TACATGAACG CTCCCCTAGG 
CAGGTACCGC AGCCCTCTGC ACTGTATGCT GAAGATGGTG GCTCAGGAGG 
GACCCACGGC CTTCTACAAA GGATTTGTGC CCTCCTTTCT GCGTCTGGGA 

35 GCTTGGAACG TGATGATGTT TGTAACATAT GAGCAACTGA AGAGGGCCTT 
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AATGAAAGTC CAGGTACTGC GGGAATCTCC GTTTTGAACA AGGCAAGCAG 
GCTGCCTGGA ACAGAACAAA GCGTCTCTGC CCTGGGGACA CAGGCCCACA 
CGGTCCAGAA CCCTGCACTG CTGCTGACAC GAGAAACTGA ACTAAAAGAG 
GAGAGTTTTA GTCCTCCGTG TTTCGTCCTA AAACACCTCT GTTTTGCACT 
5 GACCTGATGG GAAATAAATT ATATTAATTT TTAAACCCTT TCCGGTTGGA 
TGCCTAACAT TTAGGCAAGA GACAACAAAG AAAACCAGAG TCAACTCCCT 
TGAAATGTAG GAATAAAGGA TGCATAATAA ACAGGAAAGG CACAGGTTTT 
GAGAAGATCA GCCCACAGTG TTGTCCTTGA ATCAAACAAA ATGGTCGGAG 
GAACCCTTCG GGTTCAGCAC AAAGAGGTGA CTACAGCCTT TTGGTCACCA 
10 GATGACTCCG CCCCTTTGTA ATGAGTCTGC CAAGTAGACT CTATCAAGAT 
TCTGGGGAAA GGAGAAAGAA CACATTGACC TGCCCGGGCG GCCGCTCGAG 

CCCTATGA, disclosed as SEQ ID NO: 17. 

21. A DNA molecule of claim 20 which comprises from 
15 about nucleotide 211 to about nucleotide 1137 of SEQ ID NO:17. 

22. A purified DNA molecule encoding mouse 
uncoupling protein 3 wherein said DNA molecule encodes a protein 
comprising the amino acid sequence, MVGLQPSEVP PTTWKFLGA 

20 GTAACFADLL TFPLDTAKVR LQIQGENPGA QSVQYRGVLG TILTMVRTEG 
PRSPYSGLVA GLHRQMSFAS IRIGLYDSVK QFYTPKGADH SSVAIRILAG 
CTTGAMAVTC AQPTDWKVR FQAMIRLGTG GERKYRGTMD AYRTIAREEG 
VRGLWKGTWP NITRNAIVNC AEMVTYDIIK EKLLESHLFT DNFPCHFVSA 
FGAGFCATW ASPVDWKTR YMNAPLGRYR SPLHCMLKMV AQEGPTAFYK 

25 GFVPSFLRLG AWNVMMFVTY EQLKRALMKV QVLRESPF* as set forth in 

three-letter abbreviation in SEQ ID NO: 18. 

23. An expression vector for the expression of a mouse 
uncoupling protein 3 in a recombinant host cell wherein said expression 

30 vector comprises the DNA molecule of claim 20. 

24. An expression vector of claim 23 which is a 
eukaryotic expression vector. 
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25. An expression vector of claim 23 which is a 
prokaryotic expression vector. 

26. A host cell which expresses a recombinant mouse 
5 uncoupling protein 3 wherein said host cell contains the expression 

vector of claim 23. 

27. A host cell which expresses a recombinant mouse 
uncoupling 3 protein wherein said host cell contains the expression 

10 vector of claim 24. 

28. A host cell which expresses a recombinant mouse 
uncoupling protein 3 wherein said host cell contains the expression 
vector of claim 25. 

15 

29. A host cell of claim 26 wherein said mouse 
uncoupling protein 3 is overexpressed from said expression vector. 

30. A host cell of claim 27 wherein said mouse 

20 uncoupling protein 3 is overexpressed from said expression vector. 

31. A host cell of claim 28 wherein said mouse 
uncoupling protein 3 is overexpressed from said expression vector. 

25 32. A subcellular membrane fraction obtained from the 

host cell of claim 29 which contains recombinant mouse uncoupling 
protein 3. 

33. A subcellular membrane fraction obtained from the 
30 host cell of claim 30 which contains recombinant mouse uncoupling 

protein 3. 

34. A subcellular membrane fraction obtained from the 
host cell of claim 31 which contains recombinant mouse uncoupling 

35 protein 3. 
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35. A purified DNA molecule which consists of the 
nucleotide sequence, 

CCAGGAACAG CAGAGACAAC AGTGAATGGT GAGGCCCGGC CGTCAGATCC 
5 TGCTGCTACC TAATGGAGTG GATCCTTAGG GTGGCCCTGC ACTACCCAAC 
CTTGGCTAGA CGCACAGCTT CCTCCCTGAA CTGAAGCAAA AGATTGCCAG 
GCAAGCTCTC TCCTCGGACC TCCATAGGCA GCAAAGGAAC CAGGCCCATT 
CCCCGGGACC ATGGTTGGAC TTCAGCCCTC CGAAGTGCCT CCCACAACGG 
TTGTGAAGTT CCTGGGGGCC GGCACTGCGG CCTGTTTTGC GGACCTCCTC 

10 ACTTTTCCCC TGGACACCGC CAAGGTCCGT CTGCAGATCC AAGGGGAGAA 
CCCAGGGGCT CAGAGCGTGC AGTACCGCGG TGTGCTGGGT ACCATCCTGA 
CTATGGTGCG CACAGAGGGT CCCCGCAGCC CCTACAGCGG ACTGGTCGCT 
GGCCTGCACC GCCAGATGAG TTTTGCCTCC ATTCGAATTG GCCTCTACGA 
CTCTGTCAAG CAGTTCTACA CCCCCAAGGG AGCGGACCAC TCCAGCGTCG 

15 CCATCAGGAT TCTGGCAGGC TGCACGACAG GAGCCATGGC AGTGACCTGC 
GCCCAGCCCA CGGATGTGGT GAAGGTCCGA TTTCAAGCCA TGATACGCCT 
GGGAACTGGA GGAGAGAGGA AATACAGAGG GACTATGGAT GCCTACAGAA 
CCATCGCCAG GGAGGAAGGA GTCAGGGGCC TGTGGAAAGG GACTTGGCCC 
AACATCACAA GAAATGCCAT TGTCAACTGT GCTGAGATGG TGACCTACGA 

20 CATCATCAAG GAGAAGTTGC TGGAGTCTCA CCTGTTTACT GACAACTTCC 
CCTGTCACTT TGTCTCTGCC TTTGGAGCTG GCTTCTGTGC CACAGTGGTG 
GCCTCCCCGG TGGATGTGGT AAAGACCCGA TACATGAACG CTCCCCTAGG 
CAGGTACCGC AGCCCTCTGC ACTGTATGCT GAAGATGGTG GCTCAGGAGG 
GACCCACGGC CTTCTACAAA GGATTTGTGC CCTCCTTTCT GCGTCTGGGA 

25 GCTTGGAACG TGATGATGTT TGTAACATAT GAGCAACTGA AGAGGGCCTT 
AATGAAAGTC CAGGTACTGC GGGAATCTCC GTTTTGAACA AGGCAAGCAG 
GCTGCCTGGA ACAGAACAAA GCGTCTCTGC CCTGGGGACA CAGGCCCACA 
CGGTCCAGAA CCCTGCACTG CTGCTGACAC GAGAAACTGA ACTAAAAGAG 
GAGAGTTTTA GTCCTCCGTG TTTCGTCCTA AAACACCTCT GTTTTGCACT 

30 GACCTGATGG GAAATAAATT ATATTAATTT TTAAACCCTT TCCGGTTGGA 
TGCC TAAC AT TTAGGCAAGA GACAACAAAG AAAACCAGAG TCAACTCCCT 
TGAAATGTAG GAATAAAGGA TGCATAATAA ACAGGAAAGG CACAGGTTTT 
GAGAAGATCA GCCCACAGTG TTGTCCTTGA ATCAAACAAA ATGGTCGGAG 
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GAACCCTTCG GGTTCAGCAC AAAGAGGTGA CTACAGCCTT TTGGTCACCA 
GATGACTCCG CCCCTTTGTA ATGAGTCTGC CAAGTAGACT CTATCAAGAT 
TCTGGGGAAA GGAGAAAGAA CACATTGACC TGCCCGGGCG GCCGCTCGAG 
CCCTATGA, disclosed as SEQ ID NO:17 . 

5 

36* A purified DNA molecule encoding mouse 
uncoupling protein 3 wherein said DNA molecule encodes a protein 
consists of the amino acid sequence MVGLQPSEVP PTTWKFLGA 
GTAACFADLL TFPLDTAKVR LQIQGENPGA QSVQYRGVLG TILTMVRTEG 

10 PRSPYSGLVA GLHRQMSFAS IRIGLYDSVK QFYTPKGADH SSVAIRILAG 
CTTGAMAVTC AQPTDWKVR FQAMIRLGTG GERKYRGTMD AYRTIAREEG 
VRGLWKGTWP NITRNAIVNC AEMVTYDIIK EKLLESHLFT DNFPCHFVSA 
FGAGFCATW ASPVDWKTR YMNAPLGRYR SPLHCMLKMV AQEGPTAFYK 
GFVPSFLRLG AWNVMMFVTY EQLKRALMKV QVLRESPF*, asset forth in 

15 three-letter abbreviation in SEQ ID NO:18. 

37. A process for the expression of a mouse uncoupling 
protein 3 in a recombinant host cell, comprising: 

20 (a) transfecting the expression vector of claim 23 into 

a suitable host cell; and, 

(b) culturing the host cells of step (a) under 
conditions which allow expression of the human uncoupling protein 
25 3from the expression vector. 



38. An expression vector for the expression of a mouse 
uncoupling protein 3 in a recombinant host cell wherein said expression 
vector comprises the DNA molecule of claim 35. 
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39. A method of identifying a modulator of uncoupling 
protein 3 activity, which comprises: 

(a) combining a modulator of uncoupling protein 3 
activity with the uncoupling protein 3 or a biologically active fragment 

5 thereof; and, 

(b) measuring the effect of the modulator on the 
activity of ucoupling protein 3. 

40. The method of claim 39 wherein said uncoupling 
10 protein 3 is human uncoupling protein 3. 

41. The method of claim 40 wherein said human 
uncoupling protein 3 is disclosed as SEQ ID NO:12. 

15 42. The method of claim 39 wherein said uncoupling 

protein 3 is mouse uncoupling protein 3. 

43. The method of claim 42 wherein said mouse 
uncoupling protein 3 is disclosed as SEQ ID NO: 18* 

20 

44. A method of extending at least one partial 
cDNA sequence for the purpose of characterizing and isolating a full- 
length cDNA molecule, which comprises: 

25 a) constructing a cDNA library in a DNA vector primed 

by random, oligo-dT or a combination of both random and oligo-dT 
primers; 

b) subdividing the cDNA library into a plurality of 
30 cDNA pools, each of the cDNA pools containing from about 10,000 to 

about 20,000 cDNA molecules; 

c) amplifying each cDNA pool; 
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d) hybridizing oligonucleotide primers complentary to 
the 5' and S > portion of the partial cDNA sequence and to the 5' and 3' 
flanking region of the DNA vector; 

5 e) identifying each cDNA molecule which contains a 

flunking DNA fragment generated by PCR in each positive cDNA pool; 

f) sequencing the flanking DNA fragments; and, 

10 g) assembling the partial cDNA sequence and the 

sequence from the flanking DNA fragment(s) into a complete open 
reading frame. 



15 
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TITLE OF THE INVENTION 

HUMAN UNCOUPLING PROTEIN 3 

5 ABSTRACT OF THE DISCLOSURE 

An isolated nucleic acid molecule is disclosed which 
encodes a novel human uncoupling protein (UCP3), and related 
recombinant expression vectors, recombinant host cells and 
substantially purified forms of the UCP3 protein. The isolated nucleic 

10 acid and proteins disclosed herein will be useful in methods associated 
with identifying compounds which modulate energy expenditure and 
body weight regulation, as well as effecting diseases such as obesity and 
diabetes as well as mitochondria-associated hypermetabolism. 
Additionally, the isolated nucleic acids of the present ivention are 

15 candidates for gene therapy of a mammalian host, including humans. 



20 
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figure 1A 



1 


TCGAACTCAC 


TCACCTCCCC 


TCTCACCTCA 


CTGCCCTCAC 


CAGCCAGCCT 


51 


CTTGTCAAGT 


GATCAGGCTG 


TCAACCAACT 


TCTCTAGGAT AAGGTTTCAG 


101 


GTCAGCCTGT 


GTGTATAAGA 


CCAGTGCCAA 


GCCAGAAGCA GCAGAGACAA 


151 


CAG7GAATGA 


CAAGGAGGGC 


CCATCCAATC 


CCTGCTGCCA CCTCCTGGGA 


201 


TGGAGCCCTA 


GGGAGCCCCT 


GTGCTGCCCC 


TGCCGTGGCA GGACTCACAG 


251 


CCCCACCGCT 


GCACTGAAGC 


CCAGGGCTCT 


GGAGCAGCTC TCTCCTTGGA 


301 


CTCCTCTCGG 


CCCTAAAGGG 


ACTGGGCAGA 


GCCTTCCAGG ACTATGGTTG 


351 


GACTGAAGCC 


TTCAGACCTG 


CCTCCCACCA 


TCGCTGTGAA 


CTTCCTGGGG 


4 CI 


GCAGGCACAG 


CAGCCTGTTT 


TGCTGACCTC 


CTTACcnrc 


CACTGGACAC 


451 


AGCCAAGGTC 


CGCCTGCAGA 


TCCAGGSGGA 


GAACCASGCG 


CTCCAGACGG 


IZl 


CCCGGCTCGT 


GCAGTACCG7 


GGCGTGCTGG 


GCACCATCC7 


GACCATGGTG 


111 


CGGACTGAGG 


GTCCCTGCA5 


CCCCTACAAT 


GGGCTGGTGG 


CCGGCG7GCA 


601 


GCGCCAGATG 


AGCTTCGCC7 


CCATCCGCAT 


CGGCCTTTAC 


GACTCCG7CA 


€51 


AGCAGGTGTA 


CACCCCCAAA 


GGCGCGGACA 


ACTCCAGCCT 


CAC7ACCCG3 


•701 


Av# vi'GG C CG 


gctgcacca: 


AGGASCCATG 


GCGGTGACCT 


GTGCCCAGCC 


751 


CACAGATGTG 


GTGAAGG?CC 


GMTTCAGSC 


CAGCATACAC 


CTCGGGCCAT 


SCI 


CCAGGAGCGA 


CAGAAAATAC 


AGCGGGACTA 


TGGACGCCTA 


CAGAACCATC 


851 


GCCAGGGAGG 


AAGGAGTCAG 


GGGCCTGTGG 


AAAGGAACTT TGCCCAACAT 


501 


CATGAGGAAT 


GCTATCGTCA 


ACTGTGwTGA 


GGTGGTGACC TACGACATCC 


S51 


TCAAGGAGAA 


GCTGCTGGAC 


TATCACCTGC 


TCACTGACAA 


CTTCCCCTGC 


1001 


CACTTTGTCT 


CTGCCTTTGG 


AGCCGGCTTC 


TGTGCCACAG TGGTGGCCTC 


1051 


CCCGGTGGAC 


GTGG7GAAGA 


CCCGGTATAT 


GAACTCACCT CCAGGCCAGT 


1101 


ACTTCAGCCC 


CCTCGACTGT 


ATGATAAAGA 


TGGTGGCCCA GGAGGGCCCC 


1151 


ACASCCTTCT 


ACAAGGGATT 


TACACCCTCC 


TTTTTGCGTT TGGGATCCTG 


1201 


GAACGTGGTG 


ATGTTCGTAA 


CCTATGAGCA 


CCTGAAACGG GCCCTGATGA 


1251 


AAGTCCASA? 


GTTACGGGAA 


TCACCGTITT 


SAACAAGACA AGAAGGCCAC 


1301 


TGSTASCZAA 


CGTCTCCGAA 


ACCACTTAAG 


AATGGAAGAA XACGGTGCAT 
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Figure IB 



1351 


CCACGCACAC 


ATGGACACAG 


ACCCACACAT 


GTTTACAGAA CTGTTGTT7A 


1401 


CWUTTSCTS 


A7TCAAGAAA 


CAGAA5TAGA 


AGAGAGAGGA TTCTGGTCT7 


1<51 


CACTGCCATG 


CCTCAAGAAC 


ACCTTTGTTT 


TCCACTGACA AGATGGAAAA 


1501 


TAAATTATAT 


TAATTTTTGA 


AACCCATTAG 


GCATGCCTAA TATTTAGGCA 


1551 


AGAGAAAATA 


AACCAAGA7A 


GATCCATTTG 


GACAAAATGG AA6G7TGGAG 


1601 


ACGTGTATCC 


CCGTGAAATC 


TGGTCAGATA 


ATGAATGATA AGCAGGAAGG 


1651 


ATCGCAAGCA 


CGGGACAGGA 


GGGGCCCACA 


ATGGAGTGGG AGATCAGCCA 


1701 


CGGAGCCTSG 


AGGGACGCCA 


CCCAGCAACA 


CAGAGCTGGC GAC7GCAGCT 


1151 


GCACCATCAC 


ACATGCATCA 


TCAGCCTATT 


TGTAATATGT CTGCCACAGA 


IB 01 


GAGTCCTTTG 


GGATTC7AGG 


AAACCCAAGG 


AACAAGAGAA AAAAC7AGAG 


1B51 


CCTG75CTAA 


AGAA5CCTGC 


TGGGCCCATG 


TGAGGCTGGG GTCGTAAATA 


1901 


TTCCCCGACG 


ACACTGAAGA 


ATCAAGAGGG 


CAGCCCCCAC TTCTCC7ACA 


1951 


AAATOGAGGG 


AGCCATCCC7 


TCCCTCTCCA 


CCTCACCAGG GGTGCTATGA 


2CC1 


CATGCAA57G 


AGAAGCTG5G 


CATGAACGCA 


CT77A7AAAA GCAAAAGCTC 


2051 


TGTGTAAATC 


TAACTACAAG 


GACAATGCCT 


TGGGAGAGA? TT7C7CGG2A 


2101 


CAGAGAGGAG 


TTGCCAGGGA 


AGAAGGTTTG 


AAAGATACGG TTG7C7AGA5 


2151 


GTGAGACCAA 


AGGATCCAGA 


GACTTGGGGA 


CCAGAGGTGA CAGTGGATGA 


2201 


CGTGAAGCCA 


CAGGAGCCCC 


ACCCCCATGC 


AGCTTTTTCC CCACCCCCCC 


2251 


CACCACGCGC 


TCAATCATGA 


GTACCTCAAA 


GGATTGTTGG GCTTGGGGGA 


2301 


AAASASGTGG 


ATTCCTGGGC 


AAGAACCTAA 


AGTAGCAGGA (S£<3 IP WC i<) 
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TCGAACTCACTCACCTeeCCTCTCACCTCAeTiraCTCAC^^ 

1 A0CTTOSTeAC7W8QeGA»CTeSACT^^ 

6ATCA6CCTGTCAACCAACTTCTCTAGGATAAGG1TTCAGGTCA0CCTGTGTGTATAAGA 

61 

CTAGTCCGACAGTTOGTrt^GAGATCCTATrC^ 

CCASTGCCAAGCCAGAAGCAGCAGAGACAACAGTGAATGACAA6GAGOGWCATCCAATC 
O^ C WWWWTlCOTCU i VKafen O TCfceWAeTOTTCCTCCCCOCTACCTrAg 



181 



241 



301 



CCT0CTGCCACCTCCTCeGATeGAGeCCTA8G«AGCXXCTGTGCTG^ 

ra^CGG^G^CCCTACCTC<^TCCCTCG<^ 

OGACTCACyvCCCCCACCGCTCCACTCAAWCCAWCTCTGGACCASCTCTW 

CC^STW<^T<Wr<^CGTGACTrCOOGTCCCGACACCTCCTC 

CTCCTCTCGCCCCTAAAGGGACTGGGCAGAGCCTTCCAGGACTATGCTTGGArTGAAGCC 

6AW»5CC^^CC^CCC5TCTC8SAAS0TCCTSATACCAACCTGACTTCCG 

M V G L K P 

TrCAGACCTGCCTCCCACCATWCTCTGAAGTrCCTGG^ 

3 6 1 AACrcrcACGGAGGGTOTACCGlcACTT^ 

SCVPPTMAVKriGAGTAACF 

TGrTCACCTCGrTACCrnCCACTOGACACAGCCMGGTCCGCCTGCAGAWCAW^A 

<21 A c^C^GCAATGGAAAGG^CCTGTGTCGGTTCCAWrG» 

AJ5LVTFPl.DTAKVFL02CC£ 



60 
120 
160 
240 

3c: 

3€0 



CAACCAGGCGOTCCAGACOGCCCGGCTCGTGCAGTACCGTWCGTGCTGCGCACCATCCi 

481 



4t: 



bit 



CTTOGTCCOCCAGGTCTGCe<^CGAOCAC«^T<»CACC<^CGACCCGTCWTAGGA 
NOAVCTARtVQYRCVlGTJt 

6ACCAT6CTGCGGACTGAGGGTCCCTGCAGCCCCTACAATGWCTGCTGCCCW ^ 
T«VRTEO>CS»»»6X.VACtO 

acoccAflAWMewcocCTecMeeocifl^^ 
601 ^^g^aJ^aag^otag^ac^^^ 

C^CCCCOJU^XOCOCWCMCTCCAGC^ 



660 



"720 
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AOGA^CXTGCCGGTGACCTCTGCCCAGCCCACAGATCTGCTGAXCWTCCGXTrTCAGGC 

721 ♦ 4 * * * ♦ 

TCCTC<^ACCC^CACTC^CACOWTCCWTCTCTACACCACT^ 
CAHAVTCAQFTDVVXVRFQA 

CACCATACACCTCOGCCCATCCAO^GCGACACSAAAATACACCOGGACTA 

761 ♦ * — * * 4 8 * C 

CTCCTATCTQGAOCCCOCTAOU'J ^ 
SJHl,CPS*SX>RKYSGTiet>AY 

CAGAACOtfCGCCAGGGAGGAAC^CTCAt^^ 
•41 + * 4 900 

*TIA*EEGV*C1,WKGTLPW3 

CATGAGGAATC^ATCCTCAACTCTCCTCyiSCr^^ 

501 960 

GTACTCCTTACGATA&CAGTTGACACGACTCCAeCACTGGATGeTGW 

KRNAIVNCAEVVTYD2 LKEK 

GCTGCTGGACTATCACCTGC7CACTGACAAC7TCCCCTGCCACTTTCT 

SU 1020 

CGACGACC7GATAGTGGACGAOTGA CT5TT&AAGGGGACGGTGAAACAGAGACGGAAACC 
tLDYHLLTDNrPCKFVSAFC 

ACCCGGCTrCTCTGCCACAGTGGTOCCCTCCCCMTGGACCT&CTGAAGACCCGGTATAT 

ao2i 1D£D 

TCCWCCGAAC^CACGCTGTCACCACCGGAG^ 
ACrCATVVASPVCVVKTRYK 

CAACTCACCTCCAGGCCACTACTTCAOCCCCCTCCACTCTATCATAAAGATGGTCCCCCA 

CTTGAGTGC^CCTCCGGTCATaAAGTC0GGWACCT(^CATAr7ATTTCTACCACCCCGT 
KSPFGQYPSPtSCWlKMVAQ 

©BAGQ«CCCACA©CCTrCTACAA©eGAT^ 

12 DO 

CCTWCGGCrnnCGGAACATGTTCCCTAAATC^^ 
rCFTAFYKCrTPSrLRl-CSW 

GAACCTWTGATCnTCCnAAOT^ 

^ _ 1260 

^2 Cl 

CTTGCACCACTACAAGCATTCGATACTCG^^ 
KVV«FVTY«OX'K* A1 ' IJKV015 



CTTACOGGAATCACCCTTrTOAACAAGA 

L K I I M ' (Sett »0 Nw ; It) 
ArCACTTAAMaTOGAACXXX* CQgTQCATCC»COCACACAT«UCXCAaXCCCACACAT 

" 21 ^^tctIccttcttw^ 



1320 



1360 
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(nT7ACAGAACTCTTO?T7ACTTCT7GCTGATTCAAGAAACAGAAG?AGAAGAGAGAGGA 

isei * 1<<c 

CAAATGTCTT^CAACAAA7KlAACAACGACTAAGTICTrTGTCTrCATCTTCTCTCTCCT 

WC roG TCTr C ACTCCCXTOCCTCAAaAXCAC C J ' ii^JiiJ ij CACTSACAASATOCAAAA 

1441 * 1500 

AACACCASAACraCOCTACOOA l, i ' Jt WCT fiC AAACAAAACGTSACTGTTCTACCTTTT 

TAAATTATATTAATnTroAAACCCATIASOCATCCCTAATATTTAOOCAAWSXAAATA 

1501 « * * I^I^I'* 

ATTTAATATAATTAAAAACTTTGOCTAATCCGTACCGATrATAAATCCGTTCTCTTTTAT 

XACCAA^TAWTCCATTT<3SACAA>ATOGAA30TT0GASACCT0TATCCCCCTCAAATC 

1st: ♦ ie2D 

TTOCTTCTATCTAOGTAAACCTCT7TTACCTTCCAACCTCTOCACATA50GCCACTTTAC 
T«rraGATAATGAAraTAA6C*G<^G<^Ta;^ 

i62i * ::::: 1€£: 

XCCAGTCTATTACTTACTATTCCTCCTTCCTACCCTTCOTCCCCTCTCCTCCCCOCGTGT 
ATOGACTOCGASATCAOCCACCCACrCTC^K^CWCACCCACCAACACACAGCTCCC 
1661 TACCTCACCCTCTAGTC9GTCCCTCGGACCTCCCTGCGGTGGGTCGTTGTGTCTCGACCG 



is:: 



GArTGCAGCTCCACCATCACACATGCATCATCAGCCTATTTGTAATATGTCTGCCACAGA 
1 c^CGTCGACGTGCTAGTCTGTACGTAGTAGTCGGATAAACATTATACAGACGGTGTCT 
CAGTCCTTTGOGATTCTAGCAAACCCAAGCAACAAGAGAAAAAACTAGAGCCTGTGCTAA 

CTCAGGAAACCCTAAG>iTCCTTTGG5TTCL l i l J\ji <i L 'J L 'J J T J 1 1 UATCTC5GACACGAT7 
AGAAGCCTGCTG^CCCATGTGAGCrTCGGOTCOTAAATATTCCCCGACGACACTCAAGA 
TCTTCGGAC^CCCGGOTACACTCCGACCCCAGCATTTATAAGGGGCTGCTGTGACTTCT 
^7>CA>SAGOGCAGCCCCCACTTCTCCTACAA>ATG^SGGA5CCATCCCTTCCCTCTCCA 

ccTCACCATOCxnwTXTax» D4 0 

1981 • ******* 

GGAGTGGTCCCCACGATACTCTACCTTCACTCTTCCACCCCTACTT0CCT6AAATATTTT 

OCAAJU>GCTCTGTG7AAATCTAACTACAA0GACAAlVCCnQGflAGAGA'J^'i^^TCQGGA 

2041 CCTTT^»^^AGATro AWm C1 0 TIACOG^ 
C^CACXaUGTTGCCAOXyjlSAAKrriTGAA^ 

tfJUUC CT CAAtuiiLLLi JU iUUUU H. i i J C 7 ATCCCAACXSATCTCCACTCTCCTT 
AOGATCCAGAGACTTOG<JGACCAGA0CTGACAffroGATG>CGTGAA9CCACAGCAGCCCC 
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Figure 2D 

21il -» 4 4 4 4 * 22to 

TCCTAGGTCTCTGAfcCCCCTSGTCTCCAC^ 

ACrCCCATGCAg C r7^77 C CCCACCCCrCCCACCACC»CGCTCXXTCAT&AGTACCTCW^ 

2221 4 « * 4 27i: 

TCGGGGTACGTCGJWUU^OGGCTGOGGGGGGTGCTCCGCWGTTAGTACTCATGGAGTTT 

GGATTCTTGGGCTT(5GGGGAAAA£A5C7X5GAT7W 

22B1 4 4 4 4 4 23<C 

CCTAACAACCCGAACCCCCTTTTCTCCACCTAAGGACCCGTTCTTWATT^ 



4 
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1 KVGLKPSDVP PTKAVXFLGA GTAACFADLV TFPLDTAXVF. LQ2QGENQAV 

51 QTAKLVQYRG VLGTILTWVR TEGPCSPYNG LVAGLORQKS FASIR2GLYD 

201 SVKQVYTPXG XDNSSLTTRI LAGCTTGAKA VTCAQPTDW KVRFQASIKL 

151 GPSRSDRXYS GUOAYRTIA RTEGVRGLWX GTLPKXMRNA TVNCAEWTY 

201 DILKEKLLDY JJLLTDNFPCK FV&AFGAGFC ATWASPVDV VXTRYHNSPP 

251 GQYFSPLDCM IKKVAQEGPT AFYKGFTPSF LRLGSWNWM FVTYEQLXRA 

301 UCWQ!£*£S PF* (S<£4 \V (O^.jz) 



8/14 
figure 



• Pancreas 

^ • Kidney 2 

w c ^^^9H|EX * Skeletal Muscle © 

¥ n mm ■ k Liver gj. 

| 3 • Lung G 

•• Placenta gj 

Brain J^. 

Heart ^ 

9 



— k> a «j\s MW(kb) 
^ ^ ± U i» 



R, " 

iB 31 O 

♦ Pancreas 

^ . Kidney C 

B c a • Skeletal Muscle O 

SfQ 'Liver £ 
5 w * Lung 

• Placenta x 
v • Brain *© 
N Hean O 

MW(kb) £ 

O 



£8 



I = 

" IS 



— tO ^ s] C 

• *~ • • • 

J* A u> v> 



• Brain 

• Heart 

MW(kb) 



9 



% Pancreas g 

c .Kidney ~ 

3 Q f £ . Skeletal Muscle J3 

£ . liver $ 

*l f .Lung § 

I - Placenta & 



9/14 
FIGURE 5 




10/14 



1 


CCAGGAACAG 


CAGAGACAAC 


AGTGAATGGT 


GAGGCCCGGC 


CGTCAGATCC 


51 


TGCTGCTACC 


TAATGGAGTG 


GATCCTTAGG 


GTGGCCCTGC 


ACTACCCAAC 


101 


CTTGGCTAGA 


CGCACAGCTT 


CCTCCCTGAA 


CTGAAGCAAA 


AGATTGCCAG 


151 


GCAAGCTCTC 


TCCTCGGACC 


TCCATAGGCA 


GCAAAGGAAC 


CAGGCCCATT 


201 


CCCCGGGACC 


ATGGTTGGAC 


TTCAGCCCTC 


CGAAGTGCCT 


CCCACAACGG 


251 


TTGTGAAGTT 


CCTGGGGGCC 


GGCACTGCGG 


CCTGTTTTGC 


GGACCTCCTC 


301 


ACTTTTCCCC 


TGGACACCGC 


CAAGGTCCGT 


CTGCAGATCC 


AAGGGGAGAA 


351 


CCCAGGGGCT 


CAGAGCGTGC 


AGTACCGCGG 


TGTGCTGGGT 


ACCATCCTGA 


401 


CTATGGTGCG 


CACAGAGGGT 


CCCCGCAGCC 


CCTACAGCGG 


ACTGGTCGCT 


451 
*ji 


GGCCTGCACC 


GCCAGATGAG 


TTTTGCCTCC 


ATTCGAATTG 


GCCTCTACGA 


501 


CTCTGTCAAG 


CAGTTCTACA 


CCCCCAAGGG 


AGCGGACCAC 


TCCAGCGTCG 


551 


CCATCAGGAT 


TCTGGCAGGC 


TGCACGACAG 


GAGCCATGGC 


AGTGACCTGC 


6C1 


GCCCAGCCCA 


CGGATGTGGT 


GAAGGTCCGA 


TTTCAAGCCA 


TGATACGCCT 


651 


GGGAACTGGA 


GGAGAGAGGA 


AATACAGAGG 


GACTATGGAT 


rirCTACAGAA 


701 


CCATCGCCAG 


GGAGGAAGGA 


GTCAGGGGCC 


TGTGGAAAGG 


GACTTGGCCC 


751 


AACATCACAA 


GAAATGCCAT 


TGTCAACTGT 


GCTGAGATGG 


TGACCTACGA 


801 


CATCATCAAG 


GAGAAGTTGC 


TGGAGTCTCA 


CCTGTTTACT 


GACAACTTCC 


851 


CCTGTCACTT 


TGTCTCTGCC 


TTTGGAGCTG 


GCTTCTGTGC 


CACAGTGGTG 


901 


GCCTCCCCGG TGGATGTGGT AAAGACCCGA TACATGAACG 


CTCCCCTAGG 


951 


CAGGTACCGC 


AGCCCTCTGC 


ACTGTATGCT 


GAAGATGGTG 


GCTCAGGAGG 



FIGURE 6A 
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1001 


GACCCACGGC 


CTTCTACAAA 


GGATTTGTGC 


CCTCCTTTCT GCGTCTGGGA 


1051 


GCTTGGAACG 


TGATGATGTT 


TGTAACATAT GAGCAACTGA AGAGGGCCTT 


1101 


AATGAAAGTC 


CAGGTACTGC 


GGGAATCTCC 


GTTTTGAACA AGGCAAGCAG 


1151 


GCTGCCTGGA 


ACAGAACAAA 


GCGTCTCTGC 


CCTGGGGACA CAGGCCCACA 


1201 


CGGTCCAGAA 


CCCTGCACTG 


CTGCTGACAC 


GAGAAACTGA ACTAAAAGAG 


1251 


GAGAGTTTTA 


GTCCTCCGTG 


TTTCGTCCTA 


AAACACCTCT GTTTTGCACT 


1301 


GACCTGATGG 


GAAATAAATT ATATTAATTT TTAAACCCTT TCCGGTTGGA 


1351 


TGCCTAACAT 


TTAGGCAAGA 


GACAACAAAG 


AAAACCAGAG TCAACTCCCT 


1401 


TGAAATGTAG 


GAATAAAGGA 


TGCATAATAA 


ACAGGAAAGG CACAGGTTTT 


1451 


GAGAAGATCA 


GCCCACAGTG 


TTGTCCTTGA 


ATCAAACAAA ATGGTCGGAG 


1501 


GAACCCTTCG 


GGTTCAGCAC 


AAAGAGGTGA 


CTACAGCCTT TTGGTCACCA 


1551 


GATGACTCCG 


CCCCTTTGTA 


ATGAGTCTGC 


CAAGTAGACT CTATCAAGAT 


1601 


TCTGGGG AAA 


GGAGAAAGAA 


CACATTGACC 


TGCCCGGGCG GCCGCTCGAG 


1651 


CCCTATGA (SEQ ID NO: 17) 
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1 MVGLQPSEVP PTCWKFLGA GTAACFADLL TTPLDTAKVR LQIQGENPGA 

51 QSVQYRGVLG TILTMVRTEG PRSPYSGLVA GLHRQMSFAS IRIGLYDSVK 

101 QFYTPKGADH SSVAIRILAG CTTGAMAVTC AQPTDWKVR FQAMIRLGTG 

151 GERKYRGTWD AYRTIAREEG VRGLVJKGTWP NITRNAIVNC AEMVTYD1IK 

201 EKLLESHLFT DNFPCHFVSA FGAGFCATW ASPVDWKTR YMNAPLGRYR 

251 SPLHCMLKMV AQEGPTAFYK GFVPSFLRLG AWNVMMFVTY EQLKRALMKV 

301 QVLRESPF* (SEQ ID NO: 18) 
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FIGURE 9B 
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the specification of which [X] is attached hereto; 

I | was filed on as Application 

Serial No. and was amended 

through ( if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including 
the claims, as amended as indicated above. 

I acknowledge the duty to disclose to the Patent and Trademark Office all information known to me to be material 
to the patentability of this application in accordance with Title 37, Code of Federal Regulations, §1 .56. 

Foreign Priority 

I hereby claim foreign priority benefits under Title 35, United States Code, §119 (a)-(d) of any foreign application(s) 
for patent or inventor's certificate listed below and have also identified below any foreign application for patent or 
inventor's certificate for the same invention having a filing date before that of the application on which priority is claimed: 



Prior Foreign Application(s) Priority Claimed 











Country 


Number 


Date Filed 


Attorney Docket 



□ □ 











Country 


Number 


Date Filed 


Attorney Docket 



□ □ 
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Prior United States Filing 

! hereby claim the benefit under Title 35, United States Code, §1 19(e) of any United States provisional 
application(s) listed below. 



60/047,179 


May 20, 1997 


19959PV 


Appln. Ser. No. 


Filing Date 


Attorney Docket 


60/069,141 


December 9, 1997 


19959PV2 


Appln. Ser. No. 


Filing Date 


Attorney Docket 








Appln. Ser. No, 


Filing Date 


Attorney Docket 








Appln. Ser. No. 


Filing Date 


Attorney Docket 



I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) listed 
below and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior United States 
application(s) in the manner provided by the first paragraph of Title 35, United States Code, §1 12, 1 acknowledge the 
duty to disclose material information as defined in Title 37, Code of Federal Regulations, §1 .56 which occurred 
0 between the filing date(s) of the prior application(s) and the national or PCT international filing date of this application: 















Appln. Ser. No. 


Filing Date 


Status 


Attorney Docket 














Appln. Ser. No. 


Filing Date 


Status 


Attorney Docket 














Appln. Ser. No. 


Filing Date 


Status 


Attorney Docket 














Appln. Ser. No. 


Filing Date 


Status 


Attorney Docket 














Appln. Ser. No. 


Filing Date 


Status 


Attorney Docket 














Appln. Ser. No. 


Filing Date 


Status 


Attorney Docket 



And I hereby appoint 

J. MARK HAND JACK L TRIBBLE 

Reg. No. 36,545 Reg. No. 32,633 Reg. No. 

respectively and individually, as my attorneys or agents with full power of substitution and revocation, to prosecute this 
application and to transact all business in the Patent and Trademark Office connected therewith. Please address all 
communications to: 

J. MARK HAND 

Patent Department Telephone No. (732) 594- 3905 

Merck & Co., Inc. 
P.O. Box 2000 

Rahway, NJ 07065-0907 p age 2 of 3 
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I hereby declare further that all statements made herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of 
Title 1 8 of the United States Code and that such willful false statements may jeopardize the validity of the application or 
any patent issuing thereon. 



Full name of sole or 
joint inventor 


OINGYUN LIU 


Inventor's Signature 




Date 




Residence 


North Wales, PA 19454 


Citizenship 


China 


Post Office Address 
(if different from 
above) 


P.O. Box 2000 
Rahway, N.J. 07065 



Full name of joint 
inventor 


FANG CHEN 


Inventor's Signature 




Date 




Residence 


North Wales, PA 19454 


Citizenship 


U.S. 


Post Office Address 
(if different from 
above) 


P.O. Box 2000 
Rahway, N.J. 07065 



Full name of joint 
inventor 




Inventor's Signature 




Date 




Residence 




Citizenship 




Post Office Address 
(if different from 
above) 





Full name of joint 
inventor 




Inventor's Signature 




Date 




Residence 




Citizenship 




Post Office Address 
(if different from 
above) 
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